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Introduction 

In "Probability for Finance" (Roger, 2010) we presented the essential 
tools from probability theory which are useful in one-period financial mod- 
els 1 . We assume here that the reader is comfortable with these elements of 
probability theory. This second book is an extension to multi-period models, 
either in the discrete or continuous-time framework. 

When dealing with multi-period models, one of the key problems is to 
take into account the revelation of information over time, especially the in- 
formation transmitted by the observation of economic variables like prices, 
interest rates or exchange rates. We already referred to this problem when 
conditional expectations were developed in "Probability for Finance". This 
technical tool will be used extensively in the chapters of this book. 

As mentioned before, there are several approaches to study multi-period 
models, depending on the way time is measured. Roughly speaking, financial 
models can be divided in two families. In discrete-time models, markets are 
open on a finite or countable set of dates, denoted 0, 1, ..,T. In continuous- 
time models, markets are always open and the set of dates is an interval 
[0; T] . 

These two categories have their own advantages and drawbacks. Discrete- 
time models are easier to understand and sometimes allow to solve valuation 
problems that cannot be easily managed in continuous-time. This is the case 
for the valuation of American options. However, continuous-time models 
often provide simple analytical solutions (also called closed-form solutions) 
when discrete-time models only provide untractable solutions and/or bulky 
formulas. 

It is also clear that discrete-time models use a less sophisticated mathe- 
matical machinery and make easier economic interpretations. It is the rea- 
son why chapter 1 starts with the presentation of discrete-time stochastic 
processes. A section is devoted to Markov chains which are common tools, es- 
pecially in credit risks models. A particular subset of discrete-time processes, 
namely martingales, is especially important in finance, leading to devote a 
large part of this chapter to these processes (in their discrete-time version). 

x For example in portfolio choice models like the one developed by Harry Markowitz 
(1952, 1959). 
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The second chapter addresses continuous-time processes. Financial mod- 
els started to use these tools at the end of the sixties 2 . On a mathematical 
point of view, they are more demanding and more difficult to understand. In 
some cases, they reveal "strange" objects like continuous and nowhere dif- 
ferentiable functions. One more time, the subset of continuous martingales 
is of primary importance in valuation models but one particular process, 
called "Brownian motion" or "Wiener process" is the main building block of 
a broad category of processes used in financial models. Consequently, a non 
negligible part of this second chapter is devoted to the study of the Wiener 
process. 

Chapter 3 introduces stochastic calculus. When managing a portfolio in 
continuous-time, one has to combine quantities of assets with price variations 
to calculate the return of the portfolio. In discrete-time, it is simply written 
as a sum (over the set of dates) of products (quantities times price variations) 
and aggregated over the different stocks in the portfolio. In continuous-time, 
it is technically more difficult to perform these calculations. This problem is 
solved by using stochastic integrals. 

Moreover, a usual problem in finance is to determine the dynamics of 
prices of derivative securities (like futures contracts or options), knowing the 
dynamics of the underlying asset of the contract. It can be achieved by using 
Ito's lemma, which can be seen as a Taylor series expansion for stochas- 
tic processes. It is constantly used in continuous-time valuation models of 
derivative securities. 

Finally, it can be shown (in arbitrage-free pricing models) , that the today 
price of a security is its expected tomorrow price 3 discounted at the risk- 
free rate, the expectation being calculated under a so called risk-neutral 
probability measure. It means that in the risk-neutral world, risky assets do 
not deliver a risk premium. But they deliver such a premium in the real world. 
Therefore, a mathematical tool is needed to express the dynamics of a security 
price in the risk-neutral world, knowing the corresponding dynamics in the 
real world. Girsanov theorem is the right tool to perform this transformation; 
it is presented at the end of chapter 3. 



2 See Merton (1969, 1971) 

3 We assume that no dividends are paid between today and tomorrow. 
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Chapter 1 

Discrete-time stochastic 
processes 

1.1 Introduction 

In one-period models, the only uncertainty is about the state of nature oc- 
curing at date T, trading being realized only at date 0. There is neither 
learning by investors nor information disclosure at intermediate dates for the 
simple reason that there are no intermediate dates. 

In this chapter, we introduce a multi-period market in a simple frame- 
work. It is not the most general context since the set of relevant dates is 
assumed finite. Continuous markets will be described in the next chapter. In 
section 1.2 we introduce stochastic processes in this constrained framework. 
Section 1.3 addresses the question of information revelation over time. The 
good understanding of this section is essential, not because it is technically 
difficult, but because it describes the formal model of information revelation 
and the assumptions commonly used in multi-period financial models. In 
section 1.5 we enter the core of the subject by presenting martingales and 
their use in financial models. 

Three questions are addressed. 

1) The Doob decomposition of a martingale, useful to decompose the 
return on a financial asset in two components, a predictable one and a zero- 
mean "surprise" 1 . 

1 This term has a precise mathematical translation but for now the reader can keep in 
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2) We show that in a market without arbitrage opportunities, there 
doesn't exist a dynamic strategy allowing to "beat the market". 

3) We prove that even if one chooses the date of entry and the liquidation 
date in a smart way, one cannot beat the market. The financial gain/loss 
of the strategy has the same characteristics as the one obtained using the 
standard buy and hold strategy. 

Obviously, the results obtained in (2) and (3) are based on disputable 
assumptions, like market perfection, which are not always satisfied on real 
markets. However, this approach provides solid foundations to the theory 
of financial valuation. Deviations occur temporarily but, in most cases, the 
activity of arbitrageurs bring back prices toward the theoretical values. 

1.2 The general framework 

As mentioned in the introduction, we consider a discrete-time economy with 
a relevant set of dates denoted asT = {0, 1, T} where T < +oo. The latter 
assumption is not mandatory but finite-horizon models cover the major part 
of the financial literature. The financial market is open at dates in the set 
71 = {0,1,...,T — 1}; the last transactions are realized at date T — 1 and 
the securities pay a liquidating dividend or a terminal payoff by date T. It is 
equivalent to assume that agents consume their remaining wealth at date T 
and die just after! 

Uncertainty is described by a probability space (O, A, P) where fl is the 
set of states of nature, A is a tribe on Q and P is a probability measure on 
A. As the market is open at different dates, prices and returns are sequences 
of random variables indexed by time. Such a sequence is called a stochastic 
process. 

Definition 1 A discrete-time stochastic process is a sequence of random 
variables X = (X 0 , ...,Xt) defined on (Q,A) and taking values in 2 K. 

This definition may seem restrictive because variables X t are real random 
variables. We could also assume that they are n-dimensional random vectors 

mind the usual meaning of this word. 

2 The set R is always equipped with the Borel tribe Br. 
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but it will not be necessary in the present chapter. In the same way, index- 
ation could be done by any ordered set but it is of little interest in financial 
models where time is the natural index. 

The variables X t can be either discrete or continuous. For example, when 
the binomial distribution of stock prices was presented in the first book, it 
was referred to a stochastic process of stock prices X written as: 

X t = X t _x x Y t 

where the variables Y t were taking values u and d with probabilities p and 
1 — p. When a positive initial value X 0 is defined arbitrarily, X is a sto- 
chastic process and the X t are discrete random variables (with finite support 
in this case). On the contrary, if the X t were assumed to be lognormally 
distributed, the variables would be continuous because they could take any 
value in R + . So it is important to distinguish what is discrete. There ex- 
ist discrete(continuous)-time stochastic processes with discrete (continuous) 
variables. The time dimension and the state dimension are clearly different. 
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State 


X x {u) 


X 2 (u) 




100 


95 


Co> 2 


100 


106 


U 3 


105 


102 


CJ4 


105 


109 



Table 1.1: Definition of X\ et X 2 

1.3 Information revelation over time 
1.3.1 Filtration on a probability space 

The definition of a stochastic process shows that the time-dimension must 
be explicitly included in the description of a financial market and in the 
sequence of prices. Specifying the distributions of the X t is not sufficient 
to describe what really happens on the market. In particular, even if X\ 
and X 2 (a stock price at two successive dates) follow the same probability 
distribution, the value of X\ is known at date 1 when the value of X 2 is not 
yet revealed. Moreover, it is important to note that all the variables in the 
process (whatever their time-index is) are defined on the same space Q. 

To illustrate the point in a simple framework, consider Q = {oji, % = 1, 4} 
and A = V(Q); assume that X\ and X 2 are defined by table 1.1. 

Observing Xi at date 1 provides some information 3 . For example, X\ = 
100 reveals that the event {oj 1,012} occurs. On the contrary, observing X\ = 
105 means that the pair {003, u^} occurs. Two remarks can be done at this 
stage. First, the conditional probabilities related to the possible values of 
X 2 are changed after the observation of X\. If {ui,u 2 } is true, only prices 
95 and 106 remain possible at date 2. Second, even if observing X\ reveals 
information, uncertainty remains because the terminal date (T = 2) has not 
yet been reached. We do not know exactly at date 1 what will be the value 
of X 2 at date 2. 

Obviously this example is overly simplified. But the definitions of X\ and 
X 2 seems to possess reasonable properties when comparing to real markets. 
Agents accumulate information over time but uncertainty remains about fu- 
ture prices (at least for risky assets). We let the reader check that the 

3 This type of example was already studied in the chapter of "Probability for Finance" 
dealing with conditional expectations. 
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description of X\ and X 2 is relevant because the a-algebra generated by X\ 
is strictly included in the cr-algebra generated by the pair (X 1 , X 2 ) . 

To build a consistent financial model, it is then necessary to choose a 
relevant definition of information revelation over time. In particular, at a 
given date t, agents observe date-t prices which are no more random at 
future dates. Moreover, it seems reasonable to assume that the quantity of 
information held by investors increases over time (corresponding to the idea 
that investors don't forget anything!). 

In technical terms, these points can be summarized as follows. The list 
of events you know to be true or false at a given date s is included in the 
corresponding list at a future date t. It also means that if an event is true at 
a given date it will also be true at any future date. Filtrations are the right 
mathematical tool to translate these ideas. 

Definition 2 A filtration on a probability space (Q, A, P) is an increasing 
sequence T = {To, Ti, Tt} of sub-tribes of A. The quadruple (Q, A, P, T) 
is called a filtered probability space. 

Increasingness of tribes T is understood here as: 



for any t > 1. 

In a model ending at date T, it is reasonable to assume that no uncertainty 
remains at the terminal date T. Consequently, it is often assumed that Tt = 
A. In what follows, we always consider this assumption as satisfied without 
recalling it systematically. 

In the same way, we assume that nothing is known at date 0, that is 
To = {0;fi}. Despite the fact you know nothing, you are able to say that 
the impossible event doesn't occur and that the sure event occurs. 

In the example of table 1.1, the relevant filtration is: 



To 
Pi 
T 2 



{0,fi} 

{0, {uj 1 ,uj 2 } , {uj 3 ,u±},tt} 
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At the intermediate date t — 1 you are able to say if u 2 } or {u; 3 , u; 4 } 
occurs because you observe X\ which brings the necessary information. Con- 
sequently, at date 0, you can describe the events you will know to be true or 
false at future dates. The reader has to be conscious that it is a strong as- 
sumption. In the real life (and especially on financial markets), many events 
occur which were even not imagined by investors before they occured 4 . 

1.3.2 Adapted and predictable processes 

The information carried by a stochastic process X is increasing over time 
and we need to specify what is meant by "agents do not forget" and "X t 
is not random after date t". Consider that the future price of IBM stock 
is a stochastic process, that is a sequence of random variables. At date t, 
investors observe the stock price X t and X t will not change at date t + 1 (it 
will be X t+ i at this future date). It means that X t is random up to date t 
and becomes a number (or a constant random variable) after date t. 

If T = {J-'oyFi, ...,JF T } describes the evolution of information over time, 
T t is the date-t list of events you know as true or false. In particular, you 
know if, for example, {X t < 100} is true or not, where X t is the IBM stock 
price. It simply means that X t is measurable with respect to Tt (the same 
remark applies to any date t). 

Definition 3 1) A stochastic process X = (X 0 , Xt) is adapted to the 
filtration T if, for any t, X t is J r t -measurable. 

2) The natural filtration of a stochastic process X is the sequence T x of 
sub-tribes of A such that is the tribe generated by the variables X s ,s <t. 

Consider one- more time the Cox- Ross- Rubinstein model (1979) with X t = 
X t -iY t , the Y t being independent and taking values u and d with probabilities 
p and 1 — p. If T = 2, there are four possible paths for X between t = 0 and 
T = 2. Then, 4 states are necessary to describe the evolution of prices. We 
note Q = {uu, ud, du, dd}; at date 1, X\ is observed and we know if the price 
lies on a path starting by an up move or a down move. As mentioned before, 
the events {uu, ud} and {du, dd} are known to be true or false by that date. 

4 Probably, the number of investors who were considering the bankruptcy of Lehman 
Brothers as possible was very low, one year before it occured in September 2008. 
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T\ = {0, {uu, ud} , {du, dd} , Q} is then the relevant tribe to describe the 
information known at date 1. X\ is defined by: 

X\(uu) = Xi(ud) = uXq 
Xx(du) = X 1 (dd) = dX 0 

This variable is ^-measurable and in fact T = T x . This is a general 
remark; when only one risky asset is traded on the market, the natural filtra- 
tion of the price process is sufficient to modelize the information revelation 
in the economy 



Turning a challenge into a learning curve. 

Just another day at the office for a high performer. 

Accenture Boot Camp - your toughest test yet 

Choose Accenture for a career where the variety of opportunities and challenges allows you to make a 
difference every day. A place where you can develop your potential and grow professionally, working 
alongside talented colleagues. The only place where you can learn from our unrivalled experience, while 
helping our global clients achieve high performance. If this is your idea of a typical working day, then 
Accenture is the place to be. 



It all starts at Boot Camp. It's 48 hours 
that will stimulate your mind and 
enhance your career prospects. You'll 
spend time with other students, top 
Accenture Consultants and special 
guests. An inspirational two days 



packed with intellectual challenges 
and activities designed to let you 
discover what it really means to be a 
high performer in business. We can't 
tell you everything about Boot Camp, 
but expect a fast-paced, exhilarating 



and intense learning experience. 
It could be your toughest test yet, 
which is exactly what will make it 
your biggest opportunity. 

Find out more and apply online. 

> 

accenture 

High performance. Deliverec 



Visit accenture.com/bootcamp 

Consulting • Technology • Outsourcing 



CD 
> 

CO 
CD 



O 

"5 

CD 
CO 
CC 
CD 



Download free ebooks at bookboon.com 

15 



Stochastic Processes for Finance 



Discrete-time stochastic processes 



State 


X 0 




X 2 {uj) 


UJl 


1 


u 


u 2 


UJ 2 


1 


u 


ud 




1 


d 


du 


CJ4 


1 


d 


d 2 



Table 1.2: Definition of the price process X 

When Q is finite with Card(Q) = n, any random variable can be identi- 
fied to a vector with n components. The space L 2 (Q, A, P) is itself identified 
to K n . Moreover, L 2 (Q, Ft, P) , the space of J^-measurable random variables 
is a vector subspace of L 2 (Q, A, P) (see the geometric interpretation of con- 
ditional expectations in the first book). 

Obviously, the dimension of this subspace increases with t. In the former 
example, L 2 (f2,jF 1; P) was two-dimensional, characterized by: 

L 2 (fi, T\, P) = {(x, y, z, t) G M 4 such that x = y and z = t} 

In other words, the tribe T\ separates neither the first pair of states, nor 
the second pair. Table 1.2 shows the entire price process. To simplify, it is 
assumed that X 0 = 1 in table 1.2. 

Consider now an investor taking a position on a market where K assets 
are traded, whose price processes are denoted as X k , k = 1, K. 

He takes a position at date t and rebalances his portfolio at date t + 1. He 
still holds the date-t portfolio at date t + 1 just before the rebalancing. We 
then need to choose notations, either Q' t = (6], ■■■0 t ) or 6' t+1 = (9 t+1 , ■■■0f +1 ) 
for the portfolio held between t et t + 1. 

In discrete-time, the two choices are equivalent but, to keep the vocab- 
ulary consistent with the choices to be made in continuous-time in next 
chapters, it is preferable to select the second notation. However, one has to 
remark that 6 t+ i is revealed at date t. This corresponds to what is called a 
predictable process. 

Definition 4 A stochastic process X = (X 1 ...,X T ) is predictable if, for any 
t > 1, X t is Tt-\-measurable. 

The essential interest of predictable processes will appear in the section 
devoted to martingales. The process in definition 4 starts with an index t = 1 
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since X\ is JF 0 -measurable. When a process 9 denotes quantities of stocks, 
one can argue that the agent may already possess some stocks at date 0, as an 
initial endowment. In this case, it is convenient to denote 6q this endowment, 
knowing that the first transactions realized at date 0 will transform 6q in 9\. 

Example 1 Other stochastic processes than quantities are naturally pre- 
dictable. Consider a market on which it is possible to invest at each date 
and for one period in a locally risk-free asset (savings account); let r t be the 
rate of return on this account on the time-period [t; t + 1] and B t the amount 
obtained at date t by a rolling investment of one dollar in the savings account 
since date 0. 

Recall that r t is known at date t and the process r is adapted to the 
filtration JF, explaining the expression "locally risk-free" (only risk-free one 
period ahead). The process B is then predictable. In fact, it can be written: 

t-i 

B t = + ^ 

s=0 

B t is known at date t — 1, then B t is Tt-\-measurable. 

1.4 Markov chains 
1.4.1 Introduction 

Rating agencies like Moodys, Standard & Poor's or Fitch, regularly publish 
statistics about the evolution of the ratings of a number of financial instru- 
ments. From time to time they publish tables like the one on figure 1.1. The 
first column with elements identified from AAA (the best grade) to CCC 
(the worst grade before default) provides possible grades given by the rating 
agency, on a given year p, to given financial instruments like corporate bonds. 

The first line gives the possible states of the rating process on year p + 1 . 
There are two more elements in this line. "D" means default and "NR" 
means "not rated". The numbers in the matrix provide the proportion of 
firms moving from a rating to another one. For example, 88.46% on the top 
left means that 88.46% of bonds rated AAA on year p are still rated AAA 
on year p+ 1. The number just on the right is 8.05. It signifies that 8.05% of 
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AAA 


AA 


A 


BBB 


BB 


B 


CCC 


D 




AAA 


88 46 


805 


072 


006 


011 


0 00 


000 


000 


261 


AA 


0 63 


8827 


7.47 


0.56 


0.05 


0.13 


0.02 


0.00 


287 


A 


0.06 


232 


87.64 


5.02 


0.65 


0.22 


0.01 


0.05 


4.01 


BBB 


003 


029 


5.54 


8249 


4.68 


1 02 


011 


0.17 


567 


BB 


002 


0.11 


0.58 


7.01 


73.83 


7.64 


089 


0 98 


8.93 


B 


000 


0 09 


021 


0.39 


5.98 


7276 


342 


4.92 


1223 


CCC 


0 17 


000 


034 


1.02 


220 


9.64 


5313 


19.29 


1421 



Figure 1.1: One- year ratings migration 



the bonds rated AAA on year p have been downgraded to AA on year p + 1 . 
This kind of matrix is called a transition matrix or a migration matrix. 

On a mathematical point of view, these transition matrices are linked to 
a category of stochastic processes called Markov processes. They are defined 
as follows. 
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1.4.2 Definition and transition probabilities 

Definition 5 A process X defined on (fi, A, P, J 7 ) is a Markov process if 

for any (B\, B n ) G B^ and any [ti, t n ) G T n such that t\ < .... <t n : 

P (X tn G B n \X t . G Bj,j = 1, .., n - l) = P (X tn G B n \X tn _, G B n _ x ) 

The Markov property says that the history of the process up to date 
does not matter to determine what happens between t n _i and t n . The only 
important information is to know where is located the process at date t n -\. 

If the state space is finite, the set of values taken by the variables (X p ,p G N) 
is denoted (xi, ...,x n ) and Markov processes have specific properties. They 
are called finite Markov chains. 

Definition 6 Let (X p ,p G N) a finite Markov chain taking values in (xi, x n ) ; 

1) Transition probabilities of order 1 are the quantities: 

n(xi,p- l,p,Xj) = P(X P = xj \X p -i = Xi) (1.1) 

2) The matrix Tl p -i = (n(xi,p— l,p,xj),i,j = 1, ...,n) is the transition 
matrix of the Markov chain at date p — 1. 

2) When the transition probabilities do not depend on p, the Markov chain 
is said homogeneous (or stationary) and notations are simplified by writing 

ir(Xi,p- l,P,Xj) = TTij 

n(xi,p — l,p,Xj) is the probability for the process X to be in state Xj 
at date p knowing it is in state x^ at date p — 1. These probabilities are a 
specific case of the probabilities appearing in definition 5. 

If two lines (corresponding to "D" and "NR") are added to the transition 
matrix in figure 1.1 (with ones as diagonal terms and zero otherwise), the 
resulting square matrix is the transition matrix of a Markov chain with 9 
states. 

1.4.3 Chapman-Kolmogorov equations 

Figure 1.1 showed the one-year transition probabilities. But it is also impor- 
tant to estimate the transition probabilities for 2 years, 3 years and, more 
generally m years. For example, a bank lending money to a firm for 5 years 
is interested in the 5- year default probability. 
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For a homogeneous Markov chain, the Chapman-Kolmogorov equations 
allow to deduce the m-year transition probabilities from the corresponding 
one- year transition probabilities. 

Proposition 1 Let X = (X p ,p G N) denote a N -state homogeneous Markov 
chain with one-period transition probability matrix n = = 1, ■■-,N) . 

Let TTiJ 1 = P(X m+n = Xj \Xq = Xi) denote the m + n-period transition 
probability from state i to state j. 



4r n) =Ei =1 ^ m) 4? (i.2) 



(2) 

To keep things simple, suppose m = n = 1 and N = 3. For example, 7r 12 ; 
is the probability for the process to be in state X2 at date 2, knowing the 
date-0 state was x\. The graph below shows the 3 possible paths with the 
corresponding transition probabilities. 

Til rp 7T12 

/ 1 \ 

1"12 7T22 

Xi — > x 2 — > x 2 



\ 

7T13 



x 3 



We then deduce: 



7T^? = 7Tn7ri2 + Vri 2 7r 2 2 + ^13^32 (1.3) 



More generally, the Chapman-Kolmogorov equations "count" the possi- 
ble paths from Xj to Xj inm + n steps and simply cumulate the probabili- 
ties of all these paths. However, the formulation is very interesting because 
^(m+n) . g j nner product of the i-th row of the m-period transition ma- 
trix (^i^, i,j = 1, Tij with the j'-column of the n-period transition matrix 
yK^\i,j = 1, ...,nj . We then have the useful property. 

Corollary 1 Let X = (X p ,p G N) denote a N -state homogeneous Markov 
chain with one-period transition probability matrix n = = 1, ■■-,N) . 

Let nlj 1 ^ = P(X m = Xj \Xq = Xi) denote the m-period transition probability 
from state i to state j and tt^ the m-period transition matrix containing the 

(m) 
7T-- . 

13 

7T (m) = 7T m (1.4) 

where n m is the m-th power of the one-period transition matrix ir. 
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1.4.4 Classification of states 
Accessibility and communication 

Definition 7 1) A state Xj of a Markov chain X is accessible from a state 
Xi if there exists an integer k > 0 such that: 

> 0 (1.5) 

2) Two states x, and Xj of a Markov chain communicate if there exist 
integers k and k' such that: 

Trg 0 > 0 and nf. ] > 0 (1.6) 

Example 2 Consider the following matrix ir. 



0.6 


0.4 


0 


0 


0.3 


0.7 


0 


0 


0 


0 


0.5 


0.5 


0 


0 


0.7 


0.3 



(1.7) 



It can be observed that when the process starts in state x% or x 2} states x 5 
and £4 are never reached; they are not accessible. We can also see that x\ 
and X2 communicate. The same is true for x 3 and X4. When starting from 
one of these two states, x\ or X2 are not accessible. In fact, for this specific 
chain, there exists 2 "closed" classes {x\ 1 X2\ and {£3, £4} . 

The following proposition generalizes the above remark. 

Proposition 2 Let X = (X p ,p G N) denote a N -state homogeneous Markov 
chain with one-period transition probability matrix n = = 1, ■■■,N) . 

There exists a partition (C\,C2, ■■■Cm) of the N states such that two states 
communicate if and only if they are in the same class 5 . The sets Ck are called 
communicating classes. 

When looking at figure 1.1, we easily see that the set of ratings AAA to 
CCC communicate even if some transition probabilities are equal to 0. But 
the state D corresponding to default does not communicate with the others. 

5 The relation 1Z, defined by Xi 1Z Xj iff x% and Xj communicate, is an equivalence 
relation (reflexive, symmetric and transitive). 
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The implicit assumption is that a defaulted instrument never recovers. It 
does not mean that D is not accessible from the other states. In fact, the 
column "D" on figure 1.1 contains non zero entries. For example, a corporate 
bond rated CCC has a probability 19.29% to default on the next year. D is 
not in the same communicating class as the other states because reaching D 
prevents to come back to an other state. 

The state NR is particular because we don't know if a non rated financial 
instrument will be rated in the future. However, a simplifying assumption is 
to consider that NR instruments at a given date will stay NR in the future. 

Definition 8 A Markov chain is said irreducible if all states communicate, 
in other words there is only one communicating class. 

Periodicity 

Consider a binomial model for the evolution of a stock price S defined for 
any date t + 1 by 



A usual calibration for this model is to assume d = 1/u. In this case, if 
the initial price So is $100, you are sure that the price cannot come back to 
$100 in less than two periods. 

Definition 9 Let X = (X p ,p G N) denote a N -state homogeneous Markov 
chain with one-period transition probability matrix n = (n^, i,j = 1, N) . 

1) The periodicity of a state Xi, denoted t(i) is the greatest common 
divisor of numbers m such that Ti^ m \i^i) > 0. By convention, t(i) = 0 if for 
any m, ir( m >(i, i) = 0. 

2) A Markov chain X is aperiodic if t{i) = 1 for any i. 

In the binomial model viewed as a Markov chain, the periodicity of states 
is equal to 2. 

Definition 10 Let X = (X p ,p e N) denote a homogeneous Markov chain 
with one-period transition probability matrix n. 




uS t with probability p 
dS t with probability 1 — p 



(1.8) 
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1) Letpi(n) denote the probability of coming back to state i after n periods. 
A state i is recurrent if 

E+oo 
. 1 ft(n = 1 (1-9) 
i=i 

2) A state which is not recurrent is said transient. 

3) A state i is said positively recurrent if 

lim 7ri m) = qi > 0 (1.10) 

TO^+OO 

The chain comes back almost surely on any recurrent state after a first 
passage. When the state i is positively recurrent, the transition probability of 
coming back to state % in n states never vanishes, even when the length of the 
transition period tends to infinity. When all states are positively recurrent, 
the chain is called positively recurrent. 
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1.4.5 Stationary distribution of a Markov chain 

Proposition 3 Let X = (X p ,p e N) denote an aperiodic, positively recur- 
rent, homogeneous Markov chain. We get: 



E+oo 
.7=1 



(1.11) 



E+oo 
qi = 1. Moreover, the probabilities qi are uniquely determined by 
i=l 

the following relationships: 



Vz, qi > 0 

E+oo 
. . ft = 1 
1=1 

E+oo 
,=1 



7T; 



(1.12) 
(1.13) 
(1.14) 



TTie probability measure by the qi on the states of the chain is called the 
stationary distribution of the Markov chain. 

In fact, for a finite Markov chain, looking for the stationary distribution 
of the chain consists in calculating the powers 7r m of the one-period transi- 
tion probability matrix. When m tends to infinity, the lines of 7r m become 
identical. Each line represents the probability measure (qt,i = 1, ...,n). 

Example 3 Consider the very simple case of a two-state chain characterized 
by the following transition matrix. 



7T 



0.6 0.4 
0.4 0.6 



1.15) 



This chain obviously satisfies the assumptions of proposition 3. The suc- 
cessive powers of n give: 



7T 



" 0.52 0.48 " 




0.48 0.52 


vr 3 = 



0.504 0.496 m _ 0.5 0.5 

0.496 0.504 J and m ll > + 1 oo 7r ~ [ 0.5 0.5 

(1.16) 

The stationary distribution of the chain means that the probability of being 
in one state on the long run does not depend on the initial state. 
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1 . 5 Mart ingales 

We first recall the essential properties of conditional expectations, already 
presented in chapter 4 of Roger (2010). 

Proposition 4 Let (Y, Z) be two random variables in L 2 (O, A, P) and B, B' 
two sub-tribes of A satisfying B C B' ' : 

1) If Z is a constant c G K, E (Z \B) = c 

2) V(o, b) G M 2 , E {aZ + bY \B) = aE (Z \B) + bE (Y \B) 

3) IfZ< Y, E(Z\B) <E{Y\B) 

4) E{E{Z \B') \B) = E{Z \B) (law of iterated expectations) 

5) IfZ is B-measurable E (ZY \B) = Z E (Y \B) 

6) If Z is independent of B, E{Z \B) = E(Z) 

We can now define the stochastic processes known as martingales. 

Definition 11 a) Let (Q,A, F,P) a filtered probability space; a stochastic 
process X = (X 0 , ...,X T ) is a (J 7 , P) -martingale if: 

i) X is adapted to T 

ii) Vt G T, X t G L 1 (Sl,A,P) 

iii) VteT*, X t . 1 = E[X t \^ 1 ] 

b) X is a (JF, P)-supermartingale if (Hi) is replaced by X t -\ > E [X t {J^t-i} 

c) X is a (JF, P)-submartingale if (Hi) is replaced by X t -i < E [X t \ J-t-i] 

In this definition, we specify (J 7 , P)-martingale because being a martin- 
gale depends simultaneously on the filtration and the probability measure as 
shown in (i) to (iii). 

In the following, we will simply write "martingale" when no confusion is 
possible or P-martingale when we want to specify the probability-measure. 

Point iii) of the definition may also be written E [X t — X t -i \ J~"t-i] = 0 
because X is JF-adapted and X t -\ is JF^-measurable (see (4) of proposition 
4). 



25 



Download free ebooks at bookboon.com 



Stochastic Processes for Finance 



Discrete-time stochastic processes 



Remark 1 Using the law of iterated expectations, it is obvious to see that 
if X is a martingale (supermartingale, submartingale) we have, for any pair 
(s, t), s < t: 

E [X t \F s ] = (<,>) X s 
This relationship could also be used equivalently in point (Hi) of definition 

11. 

Just look at E[X t+ i \!Ft~i\ • Using definition 11, we know that X t _i = 
E [X t l^t-i] and X t = E [X t+ i \!F t \ . It implies 

X t _ 1 = #[£[X* fl |.F t ]|.F t _ 1 ] (1.17) 

Using now point (4) of proposition 4 leads to 

X t _ 1 = E[X t+1 \F t _ 1 ] (1.18) 

because T t -\ C J-'t+i- 

The properties of conditional expectations show that, if X is a square in- 
tegrable martingale, E [X t l^t-i] is the best JF^-measurable approximation 
of X t (in the OLS sense). Consequently, for a martingale X, X t -\ is the best 
approximation of X t conditioned on the information known at date t — 1. In 
geometrical terms, Xt-i is the orthogonal projection of X t on the subspace 
of J-i_i-measurable variables. 

This kind of process is then very well fitted to modelize "fair games" 
because E(X t ) is constant 6 . 

The most standard examples of martingales are the random walk (for 
example the gambler's wealth in a repeated fair game) and the Doob mar- 
tingale. They are described hereafter. 

Definition 12 A random walk is a stochastic process X such that X 0 = c G 
M and: 

X t = X t _x + Y t 
where the variables Y t are independent. 
6 Properties of conditional expectations lead to 

E (E [X t \T t -i })=E [X t ] = E [Xt-i] 
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Example 4 A gambler starts a fair game at date 0 with a wealth X 0 ; his 
date-t wealth X t is defined by: 

X t = X t _x + Y t 

where Y t is the gain/loss of the t-th draw. The Y s are independent and zero- 
mean (insuring the fairness of the game) random variables. 
Equivalently, the date-t wealth can be written: 

t 

X t = X 0 + Y,Ys 

8=1 

Assume that T is the natural filtration of the process Y, that is T s is the 
tribe generated by Y u , u < s. As Y s is the result of the s-th draw and X s the 
gambler's wealth after s draws, X is J 7 -adapted and we get: 



£[X t |.F t -i] = ElX^ + Yl^} 

= ElX^l^ + ElYl^} 

X being T -adapted, the first term on the RHS is equal to X t _i; moreover, 
the variables Y t are independent of each other, implying that Y t is independent 
ofFt-i- H follows that E [Y t {F^] = E [Y t ] = 0 E [X t = X t _ h then 

proving that X is a martingale. 
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Example 5 Doob's martingale 

Let Y G L 1 (0, A, P); define the stochastic process X as: 

X t = E[Y\T t ] 

X is obviously a martingale. Using the law of iterated expectations, it 
follows immediately 

EiXtl^] = E[E[Y\T t \\^t-i\ 
= E[Y\F t - 1 ]=X t - 1 

However, it is worth to mention that a process with a constant expectation 
(E [X t ] independent of t) is not always a martingale. The following example 
illustrates this point 

Example 6 Consider an urn containing an even number of balls T, half of 
them being white, the other half being black. At each date t < T a ball is 
randomly drawn without replacement. Let Y (resp. Z) the stochastic process 
counting the number of white (black) balls having been drawn. The relevant 
filtration is the natural filtration ofY (or equivalently Z) . 

Let X t = Y t — Z t the difference between the number of white and black 
balls after t draws. The stochastic process satisfies X 0 = 0 and we can write: 

X t = X t _ x + 6 t 

with St = 1 if the t-th ball is white and S t = — 1 if it is black. Consequently: 

E[X t \T t - 1 ]=X t _ 1 + E[5 t \F t -i] 

The essential difference between this process and the random walk is that 
S t is no more independent of T t -\- In fact, assume thatY t = s and Z t = t — s. 
We then get: 

T _ 

P(8 m = l\Y t = a ) = Zr^ 

This quantity is different from | as soon as s ^ |, and in this case, 
E[8 t+1 \Y t = s}^0. 

It shows that X is not a martingale. However, for any t, E(Y t ) = E(Z t ) = 
| leading to E [X t ] = 0. X is then an example of a constant mean stochas- 
tic process which is not a martingale. Obviously, this feature comes from 
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the "without replacement " characteristic of the draws. After each draw, the 
probability of drawing a white (black) ball in the next draw is changed. But 
seen from date 0, it is a fair game since the mean number of white(black) 
balls drawn between 0 and t is equal to | . 

It is worth to note that the "problem " comes from the fact that the ter- 
minal value of Y and Z is perfectly known at date 0. In fact, before the first 
draw, one already knows that Yt = Zt = \ implying Xt = 0. In technical 
terms, these three variables are T^-measurable. 

1.5.1 Doob decomposition of an adapted process 

The result stated hereafter shows that martingales come naturally in the 
description of any adapted process. 

Proposition 5 Let X be a stochastic process adapted to T, each X t being 
integrable. X may be decomposed in the following way: 

Vi G T, X t = X 0 + M t + A t (1.19) 

where M is a martingale satisfying M 0 = 0 and A is a predictable process 
such that A 0 = 0. If X is a sub-martingale, A is increasing (A t < A t+1 ) 

Proof. If X t = X 0 + M t + A t , we can write: 

E [X t - X t . x \F t - X ] = E [M t — M t -! \F t -i] + E[A t — A t - X \T t -x\ 

As M is a martingale, the first term on the RHS is 0. A being predictable, 
the second term is equal to A t — A t -\. Summing over the time-index leads 
to: 

t 

A t = Y,E[X s -X s _!\T s _ x ] 

s=l 

(We check on this equation that if X is a sub-martingale, A is increasing). 
Defining M as follows: 

* 

M t = X t - X 0 - E [X s - AV X \F S _ X \ 

s=l 

gives the desired result. ■ 



29 



Download free ebooks at bookboon.com 



Stochastic Processes for Finance 



Discrete-time stochastic processes 



The Doob decomposition 2.6 has a simple economic interpretation. Let 
Xt be the cumulative return on a financial asset between 0 and t. The decom- 
position means that in each period [s; s + 1], the return has two components: 
a predictible one and a zero-mean "surprise" (the martingale part). In fact: 

X t - X t _! = M t - M t _! +A t - A t _ x 

It leads to: 

E [X t - X t _, |.F t _i] =E[A t - A t _ x \T t _ x \ = A t - A t _ x 

In an economy populated by risk-neutral agents, A t — A t -\ would be the 
variation linked to the (locally) risk-free rate. 

1.5.2 Martingales and self-financing strategies 

Consider a market where K stocks are traded. Their date-t prices are denoted 
as X t = pfj 1 , ....,X^). They are measured in units of a numeraire. Assume 
that the price processes X k are martingales and denote 6 t = {0\ , Of) the 
portfolio held by an agent between t — 1 and t. The date-t portfolio value is: 

K 
k=l 

To be able to compare V t {6) and Vo(6>) in a meaningful way, it is necessary 
that no additional funds are invested and no funds withdrawn at intermediate 
dates s, 0 < s < t. Moreover, prices at different dates have to be measured 
in the same unit (called the numeraire 7 ). This remark allows us to introduce 
the notion of self-financing (or self-financed) strategy. But first consider the 
example described in table 1.3. There are three dates and two stocks. The 
initial endowment of the investor is 5 units of stock 1 and 10 units of stock 
2. The initial value of the position is then 5 x $100 + 10 x $60 = $1100. 

At date 1, before transactions, the investor still holds the same quantities 
and the value of the portfolio is 5x$130+10x$65 = $1300. After transactions, 
we see in the table that he holds 6 units of stock 1 and 8 units of stock 2. 

7 In most common cases, these prices are measured in date-0 or date-T monetary units 
depending on the choice between discounting and capitalization. 
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Quantities 


Prices (in $) 


Position 




Date 


Stockl 


Stock2 


Stockl 


Stock2 


Value (in $) 




0 


5 


10 


100 


60 


1100 




1 


6 


8 


130 


65 


1300 




2 


4 


11 


120 


80 


1360 





Table 1.3: Self-financing strategy S 



It means that he bought one unit of stock 1 for a cost of 130 and sold 2 
units of stock 2, then receiving 130. It turns out that the purchase of stock 
1 is exactly financed by the sale of stock 2. Obviously, when calculating the 
portfolio value at date 1 after transactions, we get 6 x $130 + 8 x $65 = $1300, 
that is the same value. 

This is the intuition of a "self-financing" strategy. If you want to hold 
more units of stock 1, you have to sell the equivalent amount of stock 2. We 
let the reader check that, at date 2, the strategy is still self-financing. 
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Definition 13 a) A portfolio strategy is a bounded predictable process 9. 
b) A strategy 9 is self- financing if, at any date t: 



K 



K 



(1.20) 



k=l 



k=l 



The LHS is equal to the date-t liquidation value of the portfolio 9 t . The 
RHS of equation 1.20 is the cost of the new portfolio 9 t+1 built at date t. 
Therefore, the equality means that funds are neither added to, nor withdrawn 
from, the strategy. On a practical point of view, it means that buying stocks 
at an intermediate date must be financed by selling other stocks already in 
the portfolio. 

The following proposition shows that if price processes are martingales, 
the value process of any self-financing strategy is also a martingale. 

It means that you cannot beat the market by picking stocks when price 
processes are martingales, or equivalently, that you cannot transform a fair 
game into a favorable game. 

Proposition 6 Let X = (X 1 , ...,X K ^j be a martingale taking values in M. K 
and 9 = (9 1 , ■ ■■,9 K ) a self -financing strategy; the process Y defined by: 



K 



In 



Y, 



k=i 



fc=i 



is a martingale. 

Proof. Remark first that Y t is the date-t value of the strategy 9, Yq being 
the initial cost of the portfolio 



E[Y t -Y t . x \F t -i 



E 



K 



(*W - oUxti) 



k=l 



The self-financing hypothesis allows to write: 



E 



K 



k=l 
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because £f=i = £Li e" t+1 X*. 

As 6 is predictable, 6 t can be put outside the conditional expectation in 
the last term. It is the same for the summation term since the expectation 
operator is linear. Consequently: 

K 

E [Y t - Y t -\ |^_x] = °tE [{Xt - Xt x ) \F t _ x ] 

k=l 

But X is a martingale, so the conditional expectations on the RHS are 
equal to 0. We finally get: 



K n k 



or equivalently: 



£[y t -y t _i|.F t _i] = 
E[Y t \^ x ] = y t _i 



o 



Proposition 6 has a "financial smell" but it is a consequence of the fol- 
lowing mathematical result. 

Proposition 7 Let X = (X 1 , X K ^j a martingale taking values in M. K and 
9 = (6 1 ,..., 9 K ) a bounded predictable process; the stochastic process Z defined 
by: 



K t 



k=l s=l 



is a martingale. 

Proof. It is sufficient to write: 



E [Z t — Z t -i \F{ 



t-i 



E 



K 



fc=i 



The same arguments as in proposition 6 can be used to get: 



E [Z t - Z t _, l^i] = 0 tE [{X? - Xt,) \^] 



k=l 
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and the conditional expectation is 0 because X is a martingale. ■ 

The definition of Z in proposition 7 gives an idea of what will be the 
stochastic integral described in chapter 3. To show the analogy with the usual 
Stieltjes integral, remember the way expectations of continuous variables are 
defined. Let Y be a random variable with density /y and CDF Fy- 

/+oo P+OO 
y fv(y) dy= y dF Y (y) 
-oo J — oo 

The expectation is in fact the Stieltjes integral of the identity function 
with respect to the CDF. This formulation is linked to the preceding chapter 
by writing E(Y) = E(Y1q) = (Y, 1q) L 2 • Consequently, 1q represents (as in 
the Riesz theorem) the expectation operator which is linear. 

Suppose now that Y is a discrete random variable taking values y% < 
2/2, ••• < y n ] we then have: 

/+oo 
y dF Y {y) 
-oo 

One can note that this formulation is close to the one used in the definition 
of Z which could be written: 

k r t 

Z t = J2 W 
k=i Jo 

Each integral J 0 * 9 k s dX k is the gain/loss of strategy 9 on asset k between 
0 and t. Without entering the details now, there is an essential difference 
between E(X) and Z t ; E(X) is a real number but Z t is a random variable. 
Moreover the CDF of Y is increasing but there is no reason for X k to be 
increasing. These differences are "heavy" and impose some precautions in 
defining the stochastic integral. 

1.5.3 Investment strategies and stopping times 

A second important question concerning investment strategies is the follow- 
ing: is there a way to select optimally the liquidation date of the portfolio? 
The intuition is simple. If a position is liquidated as soon as it generates 
a positive profit, it is surely a gain. The answer to the question is yes if 
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the time-horizon of the investor is infinite and if he can bear any level of 
intermediate losses! In other cases (that is on real markets populated by real 
agents), the answer is no. 

The most simple example is a heads or tails game where 1$ is won (tails) 
or lost (heads) each time the (fair) coin is flipped. Successive results are 
assumed independent. 

Let X t be the gambler's wealth at date t; it is defined by: 

t 

s=l 

where the Y s are random variables taking values 1 and -1 with equal proba- 
bilities (corresponding to gain or losses), X 0 being the initial wealth of the 
gambler. 

Consider the strategy in which the player stops gambling at the first date 
t such that X t = X 0 + 1. This strategy is clearly a winning one because 
the gambler ends the game with X$ + 1 with probability 1. However, to 
get such a result, any intermediate loss must be acceptable and an infinite 
time-horizon is necessary. If the gambler knows he has to stop after, say, T 
games, he cannot be sure to end with a profit. 

These two conditions don't seem reasonable (especially on financial mar- 
kets). A finite horizon and bounded intermediate possible losses prevent to 
transform a fair game into a favorable game, as we will see now. The notion 
of stopping time arises naturally to formalize this kind of situation. 
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Definition 14 A stopping time is a random variable v defined on (0, A, P) 
taking values in T such that: 

\/t G T {v = t} = {u G Q such that v{u) — t} G T% 

This definition may, at a first glance, seem very abstract but it is easily 
interpretable if v is seen as a date at which a decision must be taken (typically, 
liquidating a portfolio for example). With this view in mind, {v = t} G 
Tt simply means that liquidating the portfolio at date t can be decided 
only using past and present information. In other words, saying that the 
liquidation date is a stopping time means that you don't read in a crystal 
ball (revealing the future) before taking the decision. 

To give more intuition, assume that you send the following order to your 
broker: "buy 100 IBM stocks at the minimum price between now and the end 
of the month". The execution date is a random variable but to know that 
the price on the 25th day of the month reaches the minimum, you must know 
the prices on the following days! Consequently, the execution date is not a 
stopping time (in technical words it is JF T -measurable, not JF t -measurable) . 
Obviously, no broker would accept such an order. 

Remark that in discrete-time models with a finite Q, it is equivalent to 
define a stopping time by replacing {v = t} in definition 14 by {v < t} , or 
equivalently by {v > t} . The reason comes from the definition of a filtration. 
As T t -\ C Tt-, {v = t — 1} G Tt-i and so {v = t — 1} G T t - This reasoning 
can be done with any date t — k with k < t. Finally, as T t is a tribe, {v < t} G 
T t {v < tf = { v >t}e T t . 

The Doob's optional stopping theorem answers the question asked 
before concerning the existence of an optimal liquidation date. 

Proposition 8 Let v be a stopping time and X a martingale; if one of the 
following conditions is satisfied, then X v is integrable and E(X V ) = E (X 0 ) . 

1) v is bounded 

2) X is bounded, that is there exists K such that for every t G T and 
every u G Q, \X t (ui)\ < K. 

This proposition is interpreted as follows. Point (1) says that if the num- 
ber of draws is finite, the expected gain is zero. Point (2) means that, if X t 
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denotes the date-t wealth of the agent, resulting from investments in financial 
assets when prices are martingales, the expected wealth at the liquidation 
date v is equal to the initial wealth, as soon as wealth is bounded in absolute 
value at any date. In other words, if you are not ready to bear arbitrary large 
intermediate losses, you cannot beat the market, even by choosing intelli- 
gently a liquidation date. It is interesting to note that a purely mathematical 

result has such an intuitive financial interpretation. 

Obviously, in the abovementioned heads and tails game, conditions (1) 
and (2) are not satisfied if the game is pursued as long as a gain of 1 unit is 
not reached. 

To illustrate the point, define the stochastic process Y by: 

f X t if t < v 
* \ X v ift>v 

Y is called the stopped process of X, sometimes denoted X v where v is a 
bounded stopping time. The following result is then obtained. 

Proposition 9 If v is a bounded stopping time and X a stochastic process 
adapted to JF, the stopped process Y is also adapted to T . Moreover, if X is 
a martingale (super-martingale), it is also the case for Y. 

Proof. Let £ s = l{ v > s y, this variable is valued 1 when {v > s} occurs and 0 
otherwise. Y can be decomposed as follows: 

t 

s=l 

In fact, if t < v, the indicator functions in the sum are equal to 1 and, 
consequently Y t = X t . On the contrary, if t > v the indicator functions are 
equal to 1 for s < v and 0 beyond. We then get as expected Y t = X v . As 
{v > t} = {v < t — 1} C it follows that the process £ is predictable implying 
that Y is adapted. 

Moreover, if X is a martingale (super- martingale), Y is also a martingale 
because £ is predictable.In fact we can directly write: 

E[Y t -Y t ^\^ t ] = E[i t {X t -X t ^)\^ t ] (1.21) 
= i t E[{X t -X t _ x )\^ t ] (1.22) 
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X is a martingale, therefore 

E[(X t -X t _ 1 )\T t ] = 0 (1.23) 

This proof in fact uses some of the properties of conditional expectations 
recalled in proposition 4. ■ 

Example 7 A doubling strategy 

A little bit more sphisticated strategy is to double the stake after each 
losing draw and to stop the game after the first win. Suppose, to keep things 
simple, that the gambler starts with X 0 = 0 and bets one unit on the first 
draw. 

If t unfavorable draws occur in a row, the gambler's wealth is: 

t t-i 

8=1 S=0 

We recognize the sum of the t first terms of a geometric sequence. We 
then get: 

2* — 1 

X t = = -(2* - 1) 

2-1 v ; 
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The stake for the t + 1-th draw is 2*. In case of winning, the final wealth 
is then Xt+i = 1 = X 0 + 1. 

Observe that the expectation of the stopping time v — inf {t / X t = 1} is 
finite because the mean number of draws is: 

= £ (*) . 

8=1 V 7 

It can be rewritten as: 

+oo / j\ +°° +°° / J 

^ (^r) = implies: 

+oo 1 
s=l 

However, v does not satisfy conditions of theorem 8 because it is not 
almost surely bounded. In the same way, X is not almost surely bounded 
because the doubling strategy can generate incredibly high intermediate losses 
(if you are especially unlucky!), a situation that cannot be easily dealt with 
on real financial markets^ . 



1.5.4 Stopping times and American options 

An American (European) put option with maturity T and strike price K is 
a contract giving the right to his holder to sell a given underlying asset like 
a stock, an index or a currency, at a given price K, at any date before T (at 
date T). 

Denote Y t the payoff received by the holder if he exercises the put at date 
t.We get If = max(fr — S t ;0), S t being the date-i price of the underlying 
asset. Obviously, we assume that investors never exercise their option at 
date t when K < S t . It would be irrational to sell the underlying asset for K 
when you can sell it for a higher price on the market. 

8 Think to the cases of LTCM, Lehman Brothers, etc. 
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Moreover, the decision to exercise early is taken by comparing K — St to 
the value of the option if not exercised. To value this kind of option, the 
notion of "Snell envelope" of a stochastic process is very useful. 

Definition 15 Let Y be a stochastic process adapted to T ; the process X 
defined by : 

f X T = Y T 

\ X t = max (Y t ; E (X t+1 \F t )) si t < T 
is called the Snell envelope of Y. 

We then get the following proposition. 

Proposition 10 The Snell envelope XofYis the smallest super-martingale 
greater than Y, that is satisfying X t > Y t for any t. 

Proof. From the definition it follows immediately X t > E (X t+ i |jF t ) ; X is 
then a super-martingale. Let Z be a super-martingale greater than Y. We 
are going to show that Vt, Z t > X t . This relationship is true for t = T 
because of the definition of X. Let us use backward induction by assuming 
that Z s > X s for s > t 0 and proving that Z to -i > X to -i. 

As Z is a super-martingale, we can write : 

Z t0 ^ >E(Z t0 > E(X t0 (1.24) 

The second inequality comes from the recurrence assumption. Moreover 
Z is greater than Y; consequently: 

Z t0 -i > Y t0 _t (1.25) 

Inequalities 1.24 and 1.25 imply: 

Z to -i > max (y t0 _i; E (X to |.F to _i )) = X to _i 

Using backward induction, we get the same result for any t. 
■ 

The optimal exercise date of the American put will be the first date t at 
which X t = Y t . 
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Proposition 11 Let v the random variable defined by: 

v = M(t / X t = Y t ) 
v is a stopping time and the stopped process X v is a martingale. 
Proof. The definition of X implies v < T . We can also write: 

s-l 

{ v = s } = p| {X u > Y u }f]{X s = Y s } 

u=l 

Each of the events appearing on the RHS is in T s since the variables 
X u , Y u , u < s are JF s -measurable; consequently {v = s} G T s and v is a 
stopping time. 

X% may be decomposed using the variables £ s = 1{ V > S }, defined in propo- 
sition 9, as follows : 

t 

X? = X„ + X£ - 

s=l 

Using X s = max (Y s ; E (X s+1 |.F S )) we deduce X^(u) = X s (u) for u G 
{v > s} , which leads to: 

X s (u) = E (X s+1 \F S ) (oj) pour uj G {v > s} 

It is then sufficient to prove that X v s - X v s _ x = £ s (X s - E (X s \T s -x)) . 
But if {v > s} , the LHS is equal to X s — X s _i and the RHS too by definition 
of X s . On the event {v < s} , the LHS is 0 because X" = X^_ t = Y v . It is 
also the case for the RHS because the indicator function is 0 on the event 
{v < s}. 

We finally get: 

E {X: - XU |^ s _a ) = E {i s (X s - E {X s \F S _ X )) \F 8 __ X ) 

£ being predictable, ^ s goes out the conditional expectation which is then 
equal to 0. We have shown that: 

e(x:-x:_ 1 \f s „ 1 ) = o 

that is X v is a martingale. ■ 

To conclude this section, the following proposition shows that the stop- 
ping time v is the solution of the early exercise problem for American put 
options. 
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Proposition 12 In 9 the set of T -valued stopping times, denoted as T, the 
stopping time v defined by: 

v = w£{t/ X t = Y t } 
solves the optimization problem: 

E[Y V ] = sup E[Y U ] 

The interpretation of this result for American options is clear. The ran- 
dom variable Y t = m&x(K — S t ; 0) is called the intrinsic value of the option 
at date t. The difference X t — Y t is called the speculative value of the put 
which is always positive or equal to 0. The above proposition means that it 
is optimal to exercise an American put as soon as its speculative value is 0. 
When early exercise is done at the optimal date, the price process of the put 
option (in numeraire units) is a martingale. 



'For the proof, see for example Bingham-Kiesel (1998) ,p 80. 
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Chapter 2 

Continuous-time stochastic 
processes 

2.1 Introduction 

The discrete-time processes presented in chapter 1 are easy to understand 
because they are managed without complex mathematical tools. Moreover, 
when the number of states of nature is finite, elementary algebraic properties 
can be used to make easier the understanding. The counterpart is that no 
"beautiful" analytical formula is obtained, and sometimes it may be difficult 
to take into account the way real markets work (in continuous time). 

The price to pay to enter the continuous-time world seems large to many 
students who do not have a mathematical background. All vector spaces 
are infinite dimensional, price paths are functions of time with non intuitive 
properties. For example, they may be everywhere continuous and nowhere 
differentiable. 

In this chapter, we first present the general definition of a continuous- 
time process which is not much different of the equivalent notion in discrete- 
time. We then describe the properties of the paths commonly encountered in 
the financial literature, and the concepts of filtration, adapted process and 
predictable process as well. 

The next section is devoted to Markov and diffusion/ltd processes. The 
definition of Markov processes in continuous-time is the same as the one given 
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in discrete-time, except that the set of dates is different. Diffusion and Ito 
processes introduce more conditions on the paths. 

The most important stochastic process in financial models is the Brownian 
motion (also called Wiener process). We hope that our presentation will look 
intuitive to the reader. The final section studies the fundamental properties 
of the Brownian motion, especially those linked to martingales, very useful 
in finance. 

2.2 General framework 

The basic notations are the same as in the preceding chapter. (f2, A, P) is 
a probability space. The set of relevant dates is still denoted T but now 
T = [0; T] , T < +00. It means that markets are open in continuous-time. 

Definition 16 A stochastic process is a family of random variables X = 
(X t , t G T) taking values in (R, . 

As mentioned in the introduction, the definition is identical to the one 
given in discrete-time but the set of indices T is different. In particular, it 
is not countable. 

Definition 17 The mapping t — > X t (u) for a given u G Q is called a path 
(or trajectory) of the process X. 

In discrete-time, a path of a stochastic process is simply a sequence of 
values taken by the process on the (discrete) set of dates. Here a path is a 
function defined on an interval [0; T] of the real line. 

Three kinds of paths are usually encountered in financial models. 

• The description of prices or interest rates dynamics commonly assumes 
that paths are continuous functions. It is the case in the Black-Scholes 
(1973) option pricing model for the dynamics of the underlying asset of 
the option contract. It is also the assumption for the short-term rate 
in the models of Vasicek (1977), Cox- Ingersoll- Ross (1985) or in the 
model of Heath- Jarrow-Morton (1992) for forward rates. 
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Figure 2.1 illustrates this continuity assumption. It represents the evo- 
lution of daily closing values of the S&P500 index from 1950 to May 
2010 1 (more than 12 000 points). We observe that the path of the index 
is represented as a continuous function, even if it seems to have many 
points where the function t —>■ X t (u) is not differentiable. 



1800 




12/04/1949 25/05/1960 08/03/1971 18/02/1932 31/01/1993 14/01/2004 27/12/2014 



Figure 2.1: S&P500 daily values from 1950 to 2010 

• In some cases, it is assumed that paths are cadlag, meaning (in French 2 ) 
"continues a droite et possedant une limite a gauche". The English 
translation is right-continuous with left limit (RCLL). In fact, con- 
tinuous paths are not always the right tool to describe some economic 
variables, especially when jumps are possible. For example, it is now 
well known that stock returns are not normally distributed because ex- 
treme returns occur much more frequently than what is predicted by 
the Gaussian distribution hypothesis. 

Introducing discontinuities in paths may be a way to describe this phe- 
nomenon (see Merton (1976), Cox-Ross (1976), etc.), for example by 

1 Dates are in the format JJ/MM/YY on the graph 

2 We provide the French expression because the acronym cadlag has become standard 
in many books or papers, even in English ones. 
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using Poisson processes (see definition 22). Figure 2.2 shows the daily 
returns of the S&P500 index in October 2008. We observe a contin- 
uous curve simply because successive values have been joined, but it 
is largely artificial. Daily variations are very large (around f 0% in ab- 
solute value on some days 3 ) and it could be more relevant to take jumps 
into account to represent the evolution of the index. 
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Figure 2.2: S&P500 daily returns in October 2008 



3 October 2008 was a "crazy" 



month, partly because it followed the bankruptcy of 
Lehmann Brothers in September. Many other difficult periods were to follow. 
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• Paths of some stochastic processes may also be caglad ("continues 
a gauche et possedant une limite a droite"), that is left-continuous 
with right limit (LCRL). This assumption is relevant to describe 
the quantity of a given asset held by an investor. Obviously, portfolio 
rebalancing occurs in discrete-time. Choosing left continuity instead of 
right continuity is based on the following idea. When a path is left- 
continuous, X t (u) is known as soon as the X s (u),s < t are known. 
In other words, if you know the composition of your portfolio at any 
past instant, you also know it now. Remember what we wrote in the 
preceding chapter about predictable processes for quantities. The idea 
is the same here and we will come back to this point later on. 

Continuous-time stochastic processes are a little bit more complicated 
when the state-space is not countable. In fact, there may be non-empty but 
negligible subsets of states of nature and also subsets of dates with Lebesgue 
measure zero on T . The following definition is then useful to manage this 
technical problem. 

Definition 18 1. A process Y is called a modification of X if for any 

t G T , the set Q t = {X t = Y t } has probability 1. Y is also called a 
version of X. 

2. Two processes X and Y are indistinguishable if they have in common 
"almost all " their paths; more precisely: 

3tt* e A such that P(Q*) = 1 and e fi*, Vt G T, X t (u) = Y t (cu) 

These two notions, though seemingly close, are different. Let Q = [0; 1] , T = 
[0; 1] and X defined by X t = 0 for any t and any uj. Let now Y be defined as: 

Y t (cu) = ltit = u 
= 0 elsewhere 

It is obvious that P ({X t = Y t }) = 1 when P is the Lebesgue measure 
on fi. Y is then a modification of X. On the contrary, X and Y are not 
indistinguishable because f] t Q t = 0. We cannot find two identical paths! 

We have here an illustration of the technical difficulties evoked in chap- 
ter 1 of "Probability for Finance" (Roger, 2010) when defining probability 
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measures and tribes. The problem is that a countable intersection of sets of 
probability 1 is also a set of probability 1. It is not true when the intersection 
under consideration is uncountable. 

These are only technical difficulties but we have to mention them to be 
rigorous. However, in financial models, we are rarely concerned with such 
questions when dealing with real life problems, simply because continuous- 
time models are an idealization of what is really happening on financial mar- 
kets. 

2.2.1 Filtrations, adapted and predictable processes 

The notion of filtration is still relevant in continuous-time models but some 
more conditions are needed to ensure tractability. 

Definition 19 1. A filtration on Q is an increasing family J 7 = {Tt, t G T} 
of sub-tribes of A. The quadruple (Q, A, T, P) is called a filtered prob- 
ability space. 

2. A filtration T is right- continuous if for any t < T, Tt = f] s>t T s . 

3. A filtration T is complete is any tribe T t contains all negligible events. 

4- A process X is adapted to a filtration T if for any t, X t is T t -measurable. 

5. The natural filtration of a process X, denoted as T , is the smallest 
tribe with respect to which X is adapted. Tf is the tribe generated by 
the variables X s ,s < t. 

When Q is finite, it is generally assumed that P(uj) > 0 for any state u. 
When Q is uncountable, assuming complete tribes is a convenient assumption 
for purely technical reasons. Without this assumption, we could imagine that 
some set B e Tt satisfies P{B) = 0 but a subset A of B is not in Tt- The 
definition of a probability measure implies that P(A) < P(B) when A C B 
but P(A) would not be defined if A was not an event! 

In what follows, we always assume that the filtrations we refer to are 
right-continuous and complete, even if this assumption is not recalled. These 
two assumptions are sometimes called (with some sense of humor) "usual 
conditions". 
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In discrete-time, a predictable process has an intuitive definition. A 
process is predictable when its date-t value is revealed at date t — 1 or, 
more generally, when X t is ^_i-measurable. It appears immediately that 
generalizing n X t is JF 4 _ 1 -measurable" to continuous-time is not natural and 
may be quite involved. 

To avoid the reader technical difficulties, we give a restricted (and then 
disputable) definition of predictability. It is sufficient for what we deal with 
in finance and is much more intuitive than the completely general definition. 

Definition 20 X is a predictable process if it is adapted to T and if its 

paths are left- continuous 4 . 

To give an intuition of the definition, consider a function / : R — >M. and 
suppose that you know all the values f(x) for x < x 0 . When is it sufficient 
to know f(xo)7 

Simply when / is left-continuous, because left-continuity means: 

hm f(x) = f(x 0 ) 

X<X(, 

The notation combining x — > x 0 and x < x 0 is sometimes denoted x — > 

x 0 . 

In the same way, left-continuous paths for a stochastic process mean that 
if you know the values at any date t < t 0 , you can infer the date-t value of the 
process. It corresponds pretty well to the intuitive notion of predictability. 

Definition 21 Let X be a process defined on (Q, A, P) ; 

1. The increments of X are independent if for any t\ < t% < ... < t n , 
the variables X t . — X t ._ x are independent. 

2. The increments of X are stationnary if for any t 6 T and h > 0 such 
that t + h G T, the probability distribution of X t+h — X t depends only 
on h (and not on t). 

4 To be completely rigorous, we should consider the tribe £>* on f2 x T generated by the 
processes with left-continuous paths. Predictable processes are then the ^-measurable 
processes (see DufHe, 1988, p!40). 
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Example 8 The most simple process satisfying definition 21 in discrete-time 
is a random walk X defined by: 



where the Y t are i.i.d. It is obvious in this case to check that increments 
are i.i.d because the increments of X are exactly the variables Y t . In a next 
section, we will see that the Brownian motion is the typical example of such 
a process in continuous-time. 

When X t is the logarithm of a stock price (whose process is denoted S), 
the difference X t — X s is the continuous return of the asset on the interval 
[s; t] . In fact, X t = ln(S t ) and then 



Assuming that X has independent increments is then linked to the efficient 
market hypothesis (EMH). If current prices reflect all past and present in- 
formation, Xt — X s doesn't depend on past returns. It is not useful to know 
past prices to infer the distribution of future prices, only the current price is 
important. 

The stationnarity hypothesis is often used (at least implicitly) in empirical 
studies, for example when the variance of returns is estimated using a time- 
series of daily returns. 



X 0 



X t -i + Y t 



0 
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A second important example of a process with independent increments is 
the Poisson process. 

Definition 22 A process X is a Poisson process with parameter X if: 

a) X 0 = 0 

b) X has independent increments 

c) For any pair (s, t) G M + x M. + with s < t, the variable X t — X s follows 
a Poisson distribution with parameter X(t — s). 

Vk e N, P(X t - X s = k) = (A(t ~ g)) exp (- (X(t - s))) (2.2) 

k\ 

It is worth to notice that the Poisson process is a continuous-time process 
but the state-space is discrete. The variables X t takes values in the set of 
positive integers N which is countable. 

These processes are commonly used in insurance to represent the arrival 
of damages; they are then linked to a process Y representing the amount of 
the claim. For example, at date t, the cumulated amount to be paid by the 
insurance company is written: 

Xt 

Claim t = s ^Y i (2.3) 

i=l 

where X t is the number of claims up to date t. 

The other field where Poisson processes arise naturally is the microstruc- 
ture of financial markets. The arrival of buy and sell orders on a stock market 
is often represented by such a process. 

Equation 2.2 means that the probability of an increment greater than one 
on a very short interval of length h is negligible with respect to h. In fact 

P(X t+h -X t = 2) = ^f- exp (-Xh) ~ y h 2 (2.4) 

At the same time, the probability of a unit increment on a short interval 
of length h is proportional to h and approximately equal to Xh. 

2.2.2 Markov and diffusion processes 

In this section, we introduce several categories of continuous-time stochastic 
processes, namely Markov, diffusion and ltd processes. They cover almost 
all the processes used in financial models. 
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Markov processes 

Definition 23 A process X defined on (ft, A, P, J 7 ) is a Markov process 

if for any (Bi, B n ) £ and any (ti, ...,t n ) G T n such thatti < .... < t n : 



P (X tn e B n \X t . e B,,j = l,..,n-l)=P (X tn e B n \X tn _, e B n _ ± ) 



The financial interpretation of this definition is the same as the one given 
for independent increments. Economic information carried over by X s for 
dates s < t n _i is the same as the one contained in X tn l .The reader can 
recognize definition 5 given in chapter 1 but with a different set of dates. 

Markov processes are sometimes called "no memory" processes because 
the path used to reach the set B n _i at t n _i has no influence on the probability 
distribution of the future variations X tn — Xt^. Only the state reached at 
date t n -i matters. 

When stock prices are Markov processes, strategies based on technical 
analysis are useless. There was, and there still is, an intense debate in the 
financial literature to know if stock prices are Markovian. On one side, propo- 
nents of the efficient markets hypothesis believe in the Markovian character 
of stock prices. One of their arguments is that you cannot become "rich" by 
trading on past information. On the other side, numerous empirical stud- 
ies show that some portfolios built on past information provide abnormal 
returns. To give a simple example, De Bondt and Thaler (1985) show that 
markets overreact. Buying past losers and short-selling past winners allow 
to get an abnormal return. 

Diffusion processes and Ito processes 

Definition 24 a) A diffusion process is a Markov process with continuous 
paths. 

b) A process X is an ltd process if X is a diffusion process and if there 
exist two functions a and a defined onRxT and taking values respectively 
in E and M. + defined by: 




lim 

/i-t-0 



E[X t+h -X t \X t = x] 
h 

V[X t+h -X t \X t = x] 
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To interpret \i and a think of X t as the logarithm of the date-i price of 
a stock. The increment X t+ h — X t is the return of the stock on the interval 
[t; t + h] . The function /i(x, t) is then the instantaneous expected return at 
date t when the process X is at level x. Dividing by h ensures that u is 
expressed per time-unit. The most common time-unit in financial models is 
the year. 

\x measures the direction of evolution of the process. It is generally called 
the drift. In the same way, a 2 is the instantaneous variance of returns 
per unit of time, a is called the diffusion coefficient of the process. In 
the financial literature, authors generally do not make a distinction between 
diffusion and Ito processes, the latter ones being almost always considered. 
As we will see later on, the dynamics of such processes may be written as a 
"stochastic differential" involving u and a. 

ltd processes are pretty well suited for describing prices, returns or interest 
rates. Nevertheless, assuming continuous paths is sometimes a restriction, as 
mentioned before. 



2.2.3 Martingales 

As in discrete-time, martingales play an important role in continuous-time 
valuation models. The principles and interpretations are quite close and the 
definition of continuous-time martingales is almost the same as the definition 
in discrete-time. 

Definition 25 Let (Q, A, J 7 , P) a filtered probability space; 

1. A {T ', P) -martingale is an integrable process X adapted to J 7 , satis- 
fying: 

V(s, t) e T 2 , s < t => E [X t \f s ] = X s (2.5) 

2. X is a (J 7 , P) -super-martingale if equality 2.5 is replaced by E [X t |JF S ] < 
X s . 

3. X is a (J 7 , P)-submartingale if equality 2.5 is replaced by E [X t |JF S ] > 
X s . 
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We simply write "martingale" instead of (J 7 , P)-martingale when no con- 
fusion is possible about the filtration or the probability measure. When X is 
a martingale, one can always assume that paths are right-continuous, that is 
X is assumed cadlag. Using the vocabulary describing the different possible 
paths, we could say that there exists a cadlag version of the martingale X. 

Some important results on discrete-time martingales are still valid in con- 
tinuous time. It is especially the case for the Doob decomposition. 
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Proposition 13 Let X be an integrable process adapted to T ; X can be 
decomposed as follows: 

VteT, X t = X 0 + M t + A t (2.6) 

where M is a martingale with M 0 = 0 and A is a predictable process such 
that Aq = 0. If X is a submartingale, A is increasing. 

The typical example of martingale in continuous-time is the Brownian 
motion presented in the next section. 

2.3 The Brownian motion 
2.3.1 Intuitive presentation 

The Brownian motion, also called the Wiener process, is surely the most 
commonly used stochastic process, in finance and in other sciences as well. 
It is for the set of stochastic processes what is the Gaussian distribution for 
the set of probability distributions. It is the reason why we devote a few 
pages to provide an intuitive feeling of what is a Brownian motion 5 . 

For pedagogical reasons, we start with discrete-time random walks. Re- 
member that a random walk is a stochastic process X defined by: 

X 0 = a 

where a is a constant and the Y n are assumed i.i.d. 

Let T be a given time horizon; [0; T] is divided in N sub-periods of length 
h = T/N and we look at what happens when N tends to infinity, T being 
fixed. The limit process (in a sense to be precised) will be a continuous-time 
process. 

Then, for the moment we assume T = {0, 1, N} . 

Proposition 14 Let X be a random walk with the Y n taking values a and 
—a with equal probabilities p = 1/2 

5 This section is largely based on Merton (1982). This paper was reedited in Merton's 
book (1990) entitled "Continuous-time Finance". 
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1. X has independent and stationary increments 

2. Cov (X n , X m ) = a 2 min(n, m) 

Proof. Point 1 is obvious by definition of the Y n . For point 2, suppose that 
m < n. We can write: 



Cov(X n ,X m ) = Cov(X m + ^2 Y k,X m ) 

k=m+l 

n 

= Cov(X m ,X m ) + 2j cov(Y k ,X m ) 

k=m+l 

n 

= V(X m )+ Cov(Y k ,X m ) 

k=m+l 

The second equality is obtained through the linearity of the covariance 
operator. The last equality comes from the equality Cov(X m , X m ) = V(X m ). 
Moreover the covariances Cov (Y k ,X m ) are equal to zero for k > m due to 
the independence of the Y k and X m = X 0 + ^2 k<m Yk- Finally, we use the 
assumption V(Y k ) = o~ 2 to obtain V(X m ) = Yl^i V(Xk) = ma 2 . 

Obviously, if n < m, we get Cov(X n , X m ) = V(X n ) = no 2 . ■ 

Suppose now that X n is the logarithm of the date-t price of a financial 
asset where t = nh, starting at Xq = 0 (in other words, the initial price of 
the asset is equal to 1). X n is then also the cumulated return of the asset 
on the interval [0; t] . The set T = {0, 1, N} is the set of transaction dates 
occurring in [0;T] 6 . 

Assuming N — > +oo is equivalent to suppose that the market approaches 
a continuous-time market. If the cumulated continuous return on [0; T] is 
denoted A T , we need A T = lim^r^+oo X N to get a consistent model. 

a is the standard deviation of returns on a given sub-period. It should 
depend on N because the cumulated variance is Na 2 on [0;T]. If T is 3 
months, weekly data corresponds to N = 13 and N = 91 corresponds to 
daily data. Obviously the daily volatility of returns is different from the 
corresponding weekly volatility. 

6 For example, T may be the maturity date of an option contract we try to evaluate. 
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So, the question is to link a to N in a consistent way. It is also the 
typical question you need to answer to calibrate the Cox-Ross-Rubinstein 
model (1979). When the period length changes, u and d must be changed. 

Let now (Y^,n = 1,...,N) denote the increments of the random walk 
when T = {0,1, N}. Denote cr^ the standard deviation of . The 
intuitive idea is to define the date-T logarithm of the price as Xn = X 0 + 

V^JV yN 

Note that for any N, Xn —X 0 defines the cumulated return of the asset 
on [0; T] , that is A T . Consequently when N tends to infinity and h tends to 
0, we should get a consistent description of the continuous-time process of 
returns. 

2.3.2 The assumptions 

To ensure the consistency of the approach, some precautions are needed. 
Careful assumptions are necessary about the evolution of parameters when 
N changes. We introduce hereafter the three "reasonable" assumptions pro- 
posed by Merton (1982). 

1. There exists A 1 > 0, independent of N, such that: 

Vn G T, V(X n ) > Ax 

2. There exists A 2 > 0, independent of N, such that: 

V{X N ) < A 2 

3. There exists A 3 > 0, independent of N, such that, for any n G T : 

V(Y n ) 



maxf =1 V(Yj) 



> A, 



Comments 



Assumption 1 says that uncertainty never disappears, even if the length 
of subperiods tends to 0. Consequently, the cumulated return between 
0 and t remains stochastic on continuous-time markets. 
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• Assumption 2 is, in some sense, symmetrical to the first one. It says 
that variance of returns on [0; T] doesn't explode when time is cut in 
finer and finer slices. This assumption illustrates what was already 
said in the Cox-Ross-Rubinstein framework. When the number of sub- 
periods increases, parameters u and d have to be changed. If they are 
held constant, the variance of returns would become unbounded. 

• Assumption 3 has a specific meaning. It stipulates that the variance 
of returns remains greater than a given proportion of the maximum 
variance. This assumption rejects cases in which uncertainty is concen- 
trated in some sub-periods. An example allows to understand why this 
assumption is reasonable to deal with risky assets. Assume you buy a 
lotto ticket on Tuesday when the official draw is on the next Saturday. 
Obviously, you buy a risky asset but it doesn't satisfy assumption 3. 
The problem is that nothing happens between Tuesday and Saturday. 
Consequently, the value of the ticket remains the same (no uncertainty 
about the return on intermediate periods). The value of the ticket 
changes only during the draw, depending on whether the numbers you 
chose show up in the official draw. In other words, the lotto ticket is 
locally risk-free! 

• An alternative justification (possibly more serious!) of assumption 3 is 
that on a continuous market, new information is generated in a con- 
tinuous flow, implying possible price variations at any moment. For 
the lotto ticket, no new information becomes available before the Sat- 
urday draw starts. We except the case where the sponsor of the game 
announces bankruptcy on Wednesday! 
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Remark that, if the process under consideration has stationnary incre- 
ments, assumption 3 is satisfied because the variance is constant on sub- 
periods of a given length. For the random walk considered at the beginning 
of the section we could choose A 3 = 1. 

This set of three assumptions, intuitive at the economic level, allows to 
specify which processes are good candidates to represent prices and returns 
of financial assets. We first recall the Landau notations O and o. 



Definition 26 Let f and g be two functions defined on R and taking values 
0. 



in R. We write f(h) = 0(g(h)) z/lim^ 0 4^ < +oo and f(h) = o{g{h)) if 



nm ^o m 



f{h) = 0(g(h)) means that the values of / and g have the same magnitude 
when h tends to 0. In the same way, f(h) = o(g(h)) means that / is infinitely 
small with respect to g when h tends to 0. 

The following proposition specifies the behaviour of the variance a\ when 
the length of the interval between two trading dates shrinks to 0. 

Proposition 15 

o\ = 0(h) and o\ ^ o(h) (2.7) 
where 0(h) and o(h) denote the Landau notations. 



V(X N ) = "£v(Y n N ) = Nal 



Proof. 

N 
n=l 

This equality is a direct consequence of the independence Y„ . Assumption 
2 implies Na\ < A 2 . Replacing N by ? leads to: 

It means that o\ = 0(h). 

In the same way, assumption 1 implies: 

Na\ > A x (2.9) 

or, equivalently, a\ > 4j^h. it shows that a\ ^ o(h). m 



59 



Download free ebooks at bookboon.com 



Stochastic Processes for Finance 



Continuous-time stochastic processes 



If the variance of Y n is 0(h), the values taken by Y n can be written 
as ah = c\fh and —c\fh with c a positive constant. In other words, the 
magnitude of the return is 0(\/h) on a period of length h. 

Consider now the special case c = 1; the process X may be written: 

X n = X n _i + S n \/h 

where the S n are zero- mean independent variables taking values +1 and — 1. 



3 




-4 



Figure 2.3: A random walk with h = 1 

Figure 2.3 represents a path with h = 1 and figure 2.4 a path with h = 0.2. 
The points for successive dates have been joined by straight lines. 

To understand how trajectories of the Brownian motion look like, it is 
enough to remark that the slope of the successive segment on the two figures 
is ±-^, or equivalently 

We observe that when h decreases, the slope increases. We advice the 
reader to simulate such paths for different values of h to observe the phenom- 
enon (on an Excel sheet for example) . When h tends to 0 the path looks like 
the price paths provided by any financial website or economic newspaper (or 
by figures 2.1 and 2.2 at the beginning of this chapter). 
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L.5 




•1.5 



Figure 2.4: A random walk with h = 0.2 

Define now At = limjv^ +00 Yln=i ^ n w here the S n are defined as 
before. The central limit theorem allows to conclude that A^ follows a Gaussian 
distribution with standard deviation \/T . Here, weak convergence is used. 

If two dates T and V are considered, with T < T', it is clear that At'—At 
is Gaussian with standard deviation \/T' — T and that At> — At is indepen- 
dent of At — A 0 . 

Following this intuitive presentation, we can now define more formally the 
Brownian motion which is the limit process of the random walks developed 
before when the duration between two transaction dates shrinks to 0. . 

2.3.3 Definition and general properties 

Definition 27 Let (Q,A,P) be a probability space; a standard Brownian 
motion (or Wiener process) is a stochastic process Z satisfying: 

1. Z 0 = 0 P.a.s 7 

2. The increments of Z are independent and stationnary 
'Remember that a.s means "almost surely", that is "with probability 1". 
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3. V(s, t) G M + x R+, s <t, Z t - Z s ~ JV(0, ^t^~s) 
4- Z has continuous paths. 

Without technical words, the idea behind the brownian motion is the 
same as the one behind the Gaussian distribution. When a phenomenon is 
the addition of a large number of independent components, the result should 
be driven by a Gaussian distribution. Moreover, combining points 1 and 3 of 
the definition shows that Z t is a zero-mean Gaussian variable with standard 
deviation y/t. 

In finance, price variations come from a continuous flow of information 
leading to demands and supplies of assets by investors. Therefore, if informa- 
tion is immediately reflected in prices (efficient market hypothesis) , successive 
returns should be independent. If it was not the case, investors would take 
into account this dependency... and it would be ruled out. 

The preceding section based on random walks illustrated that variations 
of Z between two dates t and t + h are of order \fh , infinitely large w.r.t 
h (see Landau notations). It explains why Brownian motion paths are very 
irregular. It is in fact impossible to draw a trajectory of a Brownian motion, 
because of the following proposition. 
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Proposition 16 Brownian motion paths are continuous but nowhere differ- 
entiate. 

Most properties of random walks can be transferred witout difficulties to 
the Brownian motion. 

Proposition 17 1. The Brownian motion Z is a Markov process. 

2. Z is a martingale with respect to its natural filtration. 

3. cov(Z t , Z s ) = min(t, s) 

Proof. Point (1) is a direct consequence of the independent increments. 

When you deal with Brownian motion, the independence of increments 
is used in most proofs. The idea is always to write something like Z t = 
Z s + [Z t — Z s ) (with s < t) and use the fact that Z s and (Z t — Z s ) are 
independent. Moreover, [Z t — Z s ) is also independent of T z , the natural 
filtration of Z. 

To prove point (2), let s and t denote two dates such that s < t. We can 
write: 

E\Z t \T z \ = E[Z s + (Z t -Z s )\F z ] 

= E[Z a \F?]+E[Z t -Z„\F?] 

The second term in the RHS is equal to 0 because Z t — Z s is independent 
of-^f: 

E [Z t - Z s \T Z \ =E[Z t - Z s \ = 0 (2.10) 

As Z is adapted to T z , Z s is T z - measurable. The properties of conditional 
expectations imply: 

E[Z S \T Z S \ =Z S (2.11) 
We finally get the martingale relationship 

E\Z t \T z \=Z s (2.12) 

To prove point (3), the same "trick" is used (assume that s < t) : 

cov(Z t , Z s ) = cov(Z t - Z s + Z s , Z s ) 

= cov(Z t - Z s , Z s ) + V(Z S ) 

= cov(Z t - Z s , Z s - Z Q ) + s 

= s 
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One more time, independent increments imply that the covariance be- 
tween Z t — Z s and Z s — Z Q is zero. 

Obviously, if t < s, Z s is decomposed in Z s — Z t + Z t and the same 
reasoning applies. ■ 



The definition of diffusion and ltd processes in section 2.2.2 introduced 
the following functions: 

/ + x v E[X t+h -X t \X t = x] 
fi{x, t) = lim 



h->o h 



2, , , V[X t+h -X t \X t = x] 



a (x, t) = lim 

It appears that the standard Wiener process is an ltd process with zero drift 
(/i(x,t) = 0) and a diffusion coefficient a(x,t) equal to 1. 



2.3.4 Usual transformations of the Wiener process 

Some functions of the Brownian motion are also martingales; the two most 
simple examples are summarized in the following proposition. 

Proposition 18 Let X and Y be defined by: 

X t = Z 2 t -t 

Yt = exp (^fZ t 

where 7 is a real number. X and Y are martingales w.r.t. T z . 

Proof. Z is square- integr able so X t is integrable; Z t is Gaussian so Y t is 
log-normal. The two processes are then integrable. 

To prove the martingale relationship, we use one more time the decompo- 
sition Z t = Z s + (Z t — Z s ) with s < t. The conditional expectation E [X t l-Tvf ) 
is transformed in the following way: 

E{X t \F?) = E{Zl\T z s )-t 

= E((Z t -Z s ) 2 + 2Z t Z s -Z 2 s \F z ) -t 

We use now: 
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1) the linearity of conditional expectations 

2) the fact that Z s is J-f— measurable 

3) the fact that Z is a martingale 

4) the independence of Z t — Z s w.r.t. T% . 
We then get: 

E{X t \T z s ) = E((Z t -Z s ) 2 \^)+2E(Z t Z s \^)-E(Z 2 \^)-t 
= E ((Z t - Z s ) 2 + 2E (Z t Z, -Z 2 s -t 
= E {{Z t - Z s f \T Z S ) + 2Z S E (Z t \T*) - Z 2 S - t 
= E((Z t -Z s ) 2 \^)+2Z 2 -Z 2 -t 

E ((Z t - Z s f) +Z 2 s -t 

(t-s) + Z 2 -t 
= Z 2 -s = X s 

We now show that Y is a martingale, using the same properties of condi- 
tional expectations. 



E(Y t \F?) = e X pf-^Wexp(7Z t )|.Ff) 

= exp f-^j E (exp ( 7 (Z t - Z s )) exp( 7 Z s ) \T Z S ) 

= exp (-^f) E ( ex P (l( Z t ~ Z s))) ex P(7^s) 

Remark now that exp (^(Z t — Z s )) follows a log- normal distribution with 

parameters 0 and 7\/i — s. Consequently E (exp ( 7 (Zj — Z s ))) = 7 
Replacing in the last equality leads to: 



E (Y t \T Z S ) = exp (V, -t^j=Y s 



and ends the proof. ■ 

The processes X and Y in proposition 18 are in fact specific transforma- 
tions of the Brownian motion by the following functions: 

(x,t) -> f(x,t) =x 2 -t 

, \ , \ ( 7 2 ^ 
(x, t) -> g(x, t) = exp I 7 x — 
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Let z(x,t) stands for the density of Z t defined by: 

1 fx 2 
z(x,t) = exp 



The functions / and g are solutions of the equation: 



h{0,s) 



h(u, t + s)z(u, t)du 



In fact, for /, we have /(0, s) = —s; the RHS is written: 

x> J r+oo / u 2\ 

f(u,t + s)z(u,t)du = , / (u 2 — (t + s)) exp I — — J du 

) ' y/2'Kt J-oo V 2t J 



'2ld 



u 2 exp ( — — ) du — (t + s) 



7= f^™ u 2 exp ^— is the variance of Z u equal to i. For g, we have: 



i + s)z(u, t)du 



1 



'27rt j -oo 

„2 



°° / 7 2 (* + *) « a ' , 
exp I 7w — I aw 



2t 



exp 



7 2 (t + s) 



'27rt 



it 



exp ( - // - — | du 



The integral is transformed by the usual method: 



exp I 7^ — 



u 
2t 



■ 1 , ^2 l 2 t 

exp ( -— 0-7*) + -y 



which leads to: 



i + s)z(w, t)du 



exp 
exp 



7 2 s 



7 2 s 



'2l\t J -OO 

--g(o,s) 



exp — 



— (u - -ft) 2 j (hi 
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2.3.5 The general Wiener process 

Using a standard Brownian motion as a model for stock returns has at least 
two serious drawbacks. The first one is that E(Z t ) = 0 for any t. Therefore, 
the expected stock returns are always 0. The second problem comes from 
the variance V(Z t ) = t meaning that a dotcom stock and an industrial firm 
(pharmaceutical, automobile, etc.) have the same variance. It does not 
correspond to empirical observations. 

Introducing the general Wiener process allows to solve these two difficul- 
ties by introducing a drift (which may be positive or negative) and a per 
time-unit variance (which may be lower or greater than 1). The natural ex- 
tension of the standard Brownian motion introduces parameters fi and a, [i 
being the drift, and a 2 the instantaneous variance per time-unit. 

Definition 28 A process W is a general Brownian motion with parameters 
/i and a if W t writes: 

W 0 = 0 

W t = {it + aZ t 

where Z is a standard Brownian motion. 
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Properties of W are easily deduced from those of Z. 

Proposition 19 1. W is a process with independent and stationnary in- 
crements 

2. For s <t,W t -W s ~ 7V(/i(t - s), a^Jt - s) 

3. cov(W t ,W s ) = a 2 (t-s) 

4- W is a sub (super) -martingale w.r.t. its natural filtration if fi> (<)0. 
5. The paths of W are (a.s) continuous and nowhere differentiable. 

Think of a stock with an initial price of 1. If W t is the cumulated return 
of this stock on [0; t] , /i > 0 means that the expected return is positive. If 
there is a risk-free asset traded on the market with a non stochastic return 
r, risk aversion of agents implies generally that fi > r. Investors require a 
premium to invest in a risky stock. 

Remark 2 1) Let h be a real number close to 0; the variation of W on 
[t; t + h] is written as: 

W t+ h -W t = fih + cr(Z t+h - Z t ) 

When building Z, starting with random walks, we observed that Z t +h — Z t 
is 0(y/h), that is infinitely large w.r.t. h when h tends to 0. It is also true for 
W and the economic consequences are important. Even if the expected return 
fi is different from zero, it is not possible to predict short-term variations. 
The term o~(Z t+ h — Z t ) is much larger (in absolute value) than fih when h 
is close to 0. It is the reason why observing very short term variations in 
prices (intraday variations for example) says nothing about the evolution on 
a longer horizon (yearly variations for example). 

2) One has to be careful with intuitions about the Brownian motion. For 
example, assume \i = 0, corresponding to a process without any drift. It is 
natural to think that paths will stay "around" zero. However, we will show 
that paths of such a Brownian motion cross any boundary ( starting from 0, 
the value goes outside any interval [a;b]) in a finite time). Consequently, if 
you observe just one path, you could conclude that there is a drift when it is 
not the case. 
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Figure 2.5: Simulation of a standard and a general Brownian motions (fi = 
10% and a = 20%) 

Remark 3 Figure 2.5 shows an example. We simulated a path of a standard 
Brownian motion on one year (the time-unit) at a daily frequency. It corre- 
sponds to the thin blue line. The bold red line is the corresponding general 
Wiener process with parameters [i = 10% and a = 20%, that are reasonable 
values for a stock return. Looking at the path on the 9 first months (up to 
t = 0.75), the curve shows a "strong positive drift" but in fact the simulation 
was obtained with a zero drift. 

Before proving the abovementioned "crossing boundary" result, it is nec- 
essary to introduce stopping times in continuous-time. 

2.3.6 Stopping times 

Definition 29 Let (Q, A, J 7 , P) be a filtered probability space; a stopping 
time is a random variable r taking values inT\ \ {+00} such that {r < t} e 
T t for any t G T. 

This definition is close to the one given in discrete-time. The essential 
difference is that it uses events {r < t] instead of {r = t] . Time being con- 
tinuous, it could happen that P({t = t}) = 0 for any t. 
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As mentioned before, stopping times come naturally in the valuation of 
American options for which early exercise is possible. The exercise decision 
must be based only on past and present information, it cannot be based on 
future prices which are not yet known. 

The value +00 has been added to the set of possible values of a stopping 
time. It is used to take into account that, sometimes, it is never optimal to 
exercise the option, even at the maturity date. In that case, it is conventional 
to write r = +00. 

Consider now a corporate bond with maturity T paying coupons (C±, Ct). 
This bond may default at date t meaning that it really pays F t < C t . The 
first default date is the stopping-time r defined as r = inf {t / F t < C t }. We 
write r = +00 if default never occurs before the maturity date T. 

Proposition 20 Let r and r' be two stopping times w.r.t. a filtration T . 
r + t', min(r, r'), max(r, r') are stopping times. 

A common alternative notation for min(r, r') is r A r' and r V r' for 
max(r, r'). 

Stopping times like r A r' and rVr' are frequently encountered in life 
insurance policies. If r and r' are the dates of death of two married people, 
some contracts stipulate that the insurance company will pay a given amount 
to the survivor at the first death date (min(r, r')) , some other contracts 
paying to the heirs at the last death date (max(r,r / )). A usual joke is to 
remark that \r — r'\ takes low values when the spouse dies first, but takes 
very large values when the husband dies first. 

More seriously, it must be noted that a sure date t is a stopping-time. 
Consequently, t A r is also a stopping time. 

A tribe T r is linked to any stopping time r. It contains all the sets 
^4 Pi {r < t} with A e A. We then get the following proposition. 

Proposition 21 Let t be a stopping time and X a stochastic process. 

1. t is F T -measurable 

2. If t is bounded and X adapted to T , then X r is T T -measurable 

3. If r and r' are two stopping times such that r < r', then T r C T r > 
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2.3.7 Properties of the Brownian motion paths 

We come now to the "crossing-boundary" property. 

Proposition 22 Let W be a Wiener process with parameters (0, a); let a 
and b denote two real numbers such that a < 0 < b and r denote the stopping 
time defined by: 

t = inf {t e R + I W t = a or W t = b} 

We then get: 

P (W T = a) 
E(t) 



b — a 
—ab 



a 2 

Proof, t is the first date at which the process reaches either a or b. First, 
remark that the probability of reaching a at date r (instead of b) doesn't 
depend on a, because at date r the process is in a or b. It is then intuitive 
that a only plays a role on the delay to reach one of the bounds but not on 
the probability of reaching one of these bounds. It follows that P (W T = a) = 
P(Z T = a). 

Consider X, the martingale defined in proposition 18 : 

X t = Z 2 t -t 



Let r* = r A s with s a positive real number; the stopped process 

E{Xf)=E(Xf) = 0 



XI satisfies: 



from which we deduce: 

\2 



E(Zl T «)=E(tAT*)<(b- 



a 



In fact, the definition of r implies that before this date, the process Z 
has reached neither a nor b. But this relationship is true for any s, implying: 

E{t) = lim E(t*) <(b- a) 2 

s— >+oo 

r is then a bounded stopping time, implying E(Z T ) = E(Z 0 ) = 0 by the 
optional stopping theorem. We can then write: 

E(Z T ) = aP(Z T = a) + 6(1 - P(Z T = a)) = 0 
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which leads to: 



P(Z T 



or 



The same reasoning applied to X allows to write: 

e(x;) = o = e(z 2 t -t) 

E{Zl) - 



b — a b — a 
As Wt = <rZ t , we then get the desired result. 



-ah 



This result shows that Z crosses one of the boundaries in a finite expected 
time. It also illustrates why interest rates cannot be easily represented by a 
Brownian motion. First, they could take negative values, and second they 
would go higher than any given threshold in a finite expected time... even if 
the threshold is 100% or 200%. The intuitive idea we can have about the 
dynamics of interest rates is that the drift should be negative when interest 
rates are high and positive when they are near zero. We then need more 
general diffusion processes than the Brownian motion to adequately modelize 
many economic variables. These processes are presented in the next chapter. 
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Chapter 3 

Stochastic integral and Ito's 
lemma 

3.1 Introduction 

In the preceding chapter we presented the elementary properties of the Wiener 
process. The way we built the Brownian motion was based on random walks 
with zero- mean binary increments Y n . Then we deduced a formulation of the 
general Wiener process by allowing non zero-mean increments and a vari- 
ance proportional to the delay between two dates. However, the parameters 
(expectation and variance per time unit) were constant. In particular they 
depended neither on the date nor on the value already reached by the sto- 
chastic process. It may be a restrictive assumption, even for relatively simple 
economic processes like interest rates or stock prices. 

Assume for example that we want to model the stochastic process driving 
the evolution of a stock price. Using a Wiener process generates several prob- 
lems but we will focus on just one for the moment. Think to the economing 
meaning of a constant expected change per time unit. It means for example 
that a price will have a yearly expected change of $10, whatever the current 
price is. Obviously a $10 change is not the same when the stock price is $20 
or when it is $200. Therefore, the Brownian motion is not a good candidate 
to model stock prices, even if it seems adequate for returns (except during 
financial crises!!!). 

The Brownian motion assumptions are then restrictive when one wants to 
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describe specific economic variables. As mentioned at the end of the preced- 
ing chapter, real data suggest that the probability of an increase (decrease) 
in interest rates is lower when rates are high (low) . A mean-reverting process 
comes naturally to mind for describing such phenomena. In the same way, 
it seems intuitive that the volatility of interest rates is higher (lower) when 
rates are high (low). Figure 3.1 illustrates the point. It represents the so- 
called U.S Fed rates on a period of 55 years. It seems reasonable to consider 
a stochastic process with non constant volatility and a varying drift. For 
example, the Fed rate felt to 0 at the end of the period under consideration 
due to the financial crisis. The drift cannot be negative after that date. 



Federal Funds Rate (effective) 
July 1954 to December 2008 

I I ' 1 ' ' ' I I I I 




1952 1958 1964 1970 1976 1982 1988 1994 2000 2006 2012 
Data 



Figure 3.1: Interest rates for Federal Funds (Source: Federal Reserve, 
http:/ /www.federalreserve.gov) 

The definition of ltd processes assumes there exist two functions fi(x, t) 
and a(x,t) depending on time and on the level reached by the process under 
consideration. These processes are good candidates to take into account the 
abovementioned problems (however, keep in mind that the Brownian motion 
is a special case of ltd process with constant parameters) . 

In this chapter we are going to "build" ltd processes using the concept 
of stochastic integral. We will then present Itd's lemma (also called Funda- 
mental theorem of stochastic calculus) which is the mathematical tool used 
to price derivative securities in continuous-time. The price of such securities 
can be written as a regular function of the price of an underlying asset. Itd's 
lemma helps to find the dynamics of prices of derivative securities, being 
given the dynamics of the price of the underlying asset. 
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Finally, we introduce the Girsanov theorem which allows to change the 
drift of an ltd process without changing its variance. In arbitrage pricing 
models, this is an important question because derivative contracts are valued 
under the assumption of no arbitrage. The valuation technique consists in 
writing the dynamics of prices in a risk-neutral world in which the expected 
returns are equal to the risk-free rate (which is different from the physical 
rate 1 ). 

3.2 The stochastic integral 
3.2.1 An intuitive approach 

As mentioned before, a brownian motion assumption is not a good way to 
describe the random evolution of stock prices, even if it may be a good idea 
to model the logarithm of stock prices. In the following, we continue to refer 
to the logarithm of a stock price but the stochastic integral we are going to 
build is a much more general approach than the one used before. 

To keep things simple, we nevertheless adopt the same pedagogical ap- 
proach as in the preceding chapter by starting with random walks. Let a 
time- interval [0; T] be given, and assume that it is divided in N sub-periods 
of equal length h with h = T/N. For example, 0, h, 2h, Nh are the dates 
at which the market is open and trades occur. 

Let X stands for the stochastic process of the logarithm of a stock price 
defined by: 

X 0 = 0 

N n X n _\ -\- Y n 

for n G T = {1,...,N}. This formulation means that X n (oS) is the value 
of the process X at date nh in state uj since h is the duration between two 
successive trading dates. Y n is still interpreted as the stock return on a period 
of length h. 

1 We do not use the term "real" rate because it usually refers to rates of return after 
having taken into account inflation. Here, "physical" refers to the expected return in the 
real world where agents are risk-averse. 
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The definition of the random walk Y n is now a little bit more involved: 

Yn = Vn-lh + &n-l (Z n h ~ Z(n-l)h) (3-1) 

where Z is a standard brownian motion. It is assumed that [i n _i and cr n _i are 
-^(n-ij/i-measurable random variables where T is the natural filtration of Z. 
This formulation is very general and, for the moment, no other assumptions 
are made on // n _ l5 er n -i- 

/i n _i is the per time- unit conditional expectation of the increment of X 
on the time-interval [(n — l)h; nh] and cr n _i is the corresponding conditional 
standard deviation. Conditional moments are defined here with respect to the 
a— algebra J-( n -i)h- It al so means that cr n _i and Z nh — Z( n _i) h are independent 
and V (Z nh - Z^-Dh) = h. 

In the preceding chapter, we described Merton's assumptions (1982) about 
the processes X and Y and we still assume they are satisfied. 

Let At the cumulated return of the stock on [0; T]; we want to write 
as the limit (in a sense to be defined) of Xn when N tends to infinity. We 
then have: 

N N N 

fffi-l (Z n h — Z{n-\)h) 

n=l n=l n=l 

Replacing h by we get: 

N / N \ N 

X N = /^2 Y n = T Vn-l I + (Tn ~ 1 ( Znh ~ Z {n-l)h) 

71 = 1 \ 71=1 / 71=1 

= A N + B N 

With A N = T(± J2n=l Vn-l) and B N = (En=l °"n-l (Znh ~ ^(n-l)h)) 

To get a consistent continuous-time model, we need to define precisely 
A = lim^+oo A N and 5 = limjv->+oo B N . 

Let a continuous-time process defined by: 

a s = A*n-i if s ^ [( n - i)^; 

For a given state of nature, ct^ (u) is then constant on each interval 
[(n — l)h; nh[ . In other words, a N (u) is then a step function. 
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For any cj, we can write: 

A N {u) 



(oj)ds 



where the integral is a usual Riemann-Stieltjes integral. When N — > +00, 
we can define, for each state u, A(uS) as: 



a s {uj)ds 



where a s (u) 
state". 



lim 



a.g (00). Here, the limit process a is defined "state-by 



Unfortunately, we cannot use the same simple approach to define the limit 
of Bn, simply because the values taken by the random variables (Z n h — Z( n _i)/i) 
are of order \fh. 

If a and Z were usual functions and not stochastic processes, we would 
get the following formulation: 



lim Bj^ 



cr,,dZ. 



by using a standard Stieltjes integral. However, this approach does not work, 
as shown in the following example. 
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3.2.2 Counter-example 

Let Cat and Dn defined by : 

N 



Cn = Z nh [Z nh — Z( n _i)/j) 

n=l 

JV 

Dn = /] Z(n-l)h {Z n h ~ -^(n-l)fe) 



n=l 

with Z a standard Brownian motion. If Z was not a random process but an 
integrable function, we would get: 

t z 2 — z 2 z 2 

lim C N = lim D N = I Z.dZ, = — = — 



/ j\l = urn un = / Zj s azj s 



But we can easily show that the expectations of Cat and D N are different 
and do not depend on N. One more time, the "trick" Z t = Z s + (Z t — Z s ) is 
useful. 



A? 



E(C N ) = E \^2i ^ nh {^ nh ~ Z{n-i)h) j (3.2) 

N 

= ^ E {Znh {Z n h — Z( n -i)h)) (3-3) 
n=l 

Using Z nh = Z(n-\)h + {Znh ~ Z( n -i)h), we get: 

N 

E{Cn) = /]E [{Z(n-l)h + {Z n h — Z( n -j)h)) {Z n h ~ -^(n-l)fe)] (3.4) 
n=l 

/ N \ 

= E I ^Z(„_i)/,, — Z(n-\)h) ) (3.5) 

+^^(^-^(n-l)/,) 2 j (3.6) 

N N 
= y^ y E ^{Z n h — Z(n-\)h) J + /^E \Z(n-\)h {Z n h ~ Z( n -l)h§\ -7) 



n=l n=l 
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We use now the properties of the Brownian motion. As Z nh — Z< n -\)h is 



zero-mean, we can write: 



E y{Z nh - Z (n _i )h ) 2 ) = V (Z nh - Z(n-i)h) = nh - (n - l)h = h 

After summation, the first term in expression (3.4) is equal to Nh = T. 

To evaluate the second term, we use the independence of increments of 
the Brownian motion. 

N N 

/ y E (Z(n-i)h (Z n h — = ^ E (Z{n-l)h) E [Z nh — Z( n _!)/{}3.8) 

n=l n=l 



We then obtain the following result: 



E(C N ) = T 

It is important to notice that it does not depend on the way the interval 
[0; T] is sliced. Equivalent ly, it does not depend on N. 

Remark now that E(Dn) has already been calculated in equation (3.8) 
and it is equal to 0. Consequently, the two sequences Cn and Dn cannot 
converge to the same limit since they have different expectations. 

This counter-example is interesting because it shows that we can define 
at least two stochastic integrals (in fact an infinity). 

In the following, we choose the one based on D^. This choice can be 
easily understood if one refers to the way the gain of a strategy is calculated. 
Remember that it was defined in discrete-time as: 

n 
s=l 



where the coefficients 9 n h denote quantities (assumed JF ( - n _ 1 ) fe -measurable be- 
cause 9 is a predictable process). 

It is also the reason why <t„_i was used in the definition of Y n (see equation 
(3.1)) and assumed ^ r ( n _i)/ l -measurable. 

To get a square-integrable limit, we need to assume that B N is square- 
integrables. 
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E (B N ) may be written as: 

E {B%) = E {(Bn — Bn-i + Pw-i) 2 ) 

= E ((B N - Piv-i) 2 ) + E (B^) + 2E (Bjv_i (B N - 

= £" (o 2 N _i (Z Nh — Z(N-i)h) j + E (B 2 N _^j + 2E (.Bjv-i) £" (5jv — B N -i) 

= hE (o-l_ x ) + E (Bl_ x ) 

The last term in the RHS is zero because of the independence of incre- 
ments of B. Moreover, {Znh — <^(jv-i)/i) is independent of o"^ r _ 1 and £" {[Znh — ^(jv-i)/i) 
/i. We then get, by a recurrence argument: 



Remark now that if P N is defined on [0; T] by: 

= &n-i f° r s G [(n — l)/t; n/i[ 

we finally get: 

E (B%) = E W <k) 

The sequence @ converges to a process which is (naturally) denoted a. 
The latter equality shows that E ( J Q T (o~ s ) 2 <isj < +00 is a sufficient condition 
for Bj\j to be square-integrable. 

We can now define more formally the stochastic integral of a stochastic 
process w.r.t. the Brownian motion. 



3.2.3 Definition and properties of the stochastic inte- 
gral 

Definition 30 Let Z be a standard Brownian motion defined on (fl, A, T z , P) 
and a a process adapted to T z . Suppose that a satisfies: 

E (j o* a dsj < +00 (3.9) 
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The ltd stochastic integral of a on [0; T] w.r.t. Z, denoted as J Q T a s dZ s , 
is the random variable defined by: 

,t n 

/ a s dZ s = lim V" (Z nh - Zr n -i)h) 

JU n=l 

where the limit refers to convergence in quadratic mean. 

It is important to remark that the index n in the RHS of the preceding 
equation is discrete and corresponds to the subdivision of [0; T] in N sub- 
periods. An equivalent alternative formulation would be to divide [0; T] in 
sub-intervals [tf, t i+ i[ such that t^ = T and to write the limit as: 

/ a s dZ s = Jim <Tti-i ( z u ~ z u-i) 
Jo ~* 00 i=l 

with max(£j — ti-i) converging to 0 when N — > +oo. Moreover, this formula- 
tion is valid for any horizon up to T. Consequently, the family (J Q U a s dZ s , u G Tj 
of random variables is a stochastic process sometimes written as (I u (a),u G T) . 

It is also worth noting that if a is predictable, cr n _i can be replaced by 
a n in definition 30. For example, for a stochastic process 6, representing 
quantities of stocks in a portfolio, the integral J Q 9 s dZ s is the gain of the 
strategy if the price process is Z. 
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Proposition 23 Let X and Y be two adapted processes such that j Q X s dZ s 
and f*Y s dZ s exist for any t < T and are square-integrable (see definition 
30). We then get: 

1. V(o, b) G M 2 , J*(aX s + bY s )dZ s = a J* X s dZ s + b /„* bY s dZ s 

2. V(t, u)eTxT,t<u, £ X s dZ s = X s dZ s + £ X s dZ s 

3. V{t,u) eTxT,t<u, E (£X s dZ s \F t ) = J*X s dZ s 

4-VteT, E ( (/ 0 * x s dz s }) = E X 2 s ds) 

Point (1) means that the mapping associating a process X to its stochastic 
integral is linear. Point (2) is a standard property of integrals. Point (3) is 
crucial for financial applications because it shows that (J 0 " X s dZ s , u e T) is 
a martingale. Finally, the last point stipulates that J^X s dZ s belongs to L 2 . 

Recall that building the stochastic integral in this way aims at looking 
for a continuous-time limit for the process X with 

X 0 = 0 

X n = X n -x + Y n = X n _i + fi n _ih + cr n _i {Z n h — Z( n _i)/j) 

Consequently, the limit process (still denoted X to save some notations) 
is formulated as: 

X 4 = X 0 + / /i s ds+ / (3.10) 
■/o Jo 

When the stochastic integral can be defined, X is an ltd process as intro- 
duced in the preceding chapter. The simplified notations fi s = fi(X s , s) and 
a s = cr(X s , s) leads to write equation 3.10 in the general form as a stochastic 
differential: 

dX t = ii(X t , t)dt + a(X t , t)dZ t 

Remark 4 It is important to keep in mind that it is only a way to translate 
that X is written as a stochastic integral. Obviously, dZ t is not a usual 
differential because the paths of the Brownian motion, though continuous, 
are nowhere differentiable. 
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If the process X is the general Brownian motion W with parameters ji 
and a (which are some constants), we write: 



We come back to the initial formulation by writing (remember that Wq = 0): 



3.2.4 Calculation rules 

The following proposition summarizes the calculation rules applied to incre- 
ments of the Brownian motion (equivalently dZ t ). To avoid technical diffi- 
culties, the reader can think of dZ t as a difference Z t +dt — Z t where dt is an 
infinitesimal time-period 2 . The general intuition is that the higher order (>1) 
terms in dt may be neglected in developments because, on markets working 
in continuous-time, the higher order terms are infinitely small w.r.t. dt. 



Proposition 24 1. E(dZ t ) = 0 et V(dZ t ) = dt 

2. V {dZ 2 t ) = o(dt) 

3. dZ t .dt = o(dt) 

I E (dZ h dZ t2 ) = 0 for h ^ t 2 

5. If Z and Z* are two Wiener processes, we get: 



Point (1) is obvious when dZ t is identified to Z t+ d t — Z t since the standard 
Brownian motion is zero- mean and V(Z t — Z s ) = t — s for t > s. 

2 This interpretation is clearly disputable but it allows an intuitive presentation of the 
results. 



dW t = fidt + adZ t 




s 



fit + aZ t 



e (dz t dz;) 

V (dZ t dZ*) 



r t dt 
o(dt) 
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Point (2) says that (dZ t ) 2 is not random or, more precisely, has a negligible 
variance. Remember the discrete-time approach with random walks. The 
increment of Z was \fd~t or —\fd~b with equal probabilities. Therefore (dZ t ) 2 
is a constant equal to dt. 

For point (3), dZ t = 0(\/dt) implies dZ t .dt = O(dt^) = o(dt). 

As soon as ti ^ t 2 , dZ tl and dZ t2 are independent because the increments 
of a Brownian motion are independent. It proves point (4). 

Finally, r t is the correlation coefficient between increments of two Brown- 
ian motions, each being 0(\fd~t). The product is then O(dt). In the same way, 
the variance of the product is 0(dt 2 ), that is o(dt). 

These rules are useful to understand Ito's lemma presented hereafter. 
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3.3 Ito's lemma 

In many valuation models, especially for derivative securities, one has to look 
for the dynamics of a regular function of an ltd process. The most well-known 
example is the valuation of a European call option with maturity T and strike 
price K. The final payoff of such an option is written: 

C T = max (X T - K; 0) . (3.11) 

A valuation model needs the determination of the dynamics of the option 
price process C = (C t ,t e T) over its lifetime. The initial value Co is espe- 
cially important for financial purposes. We assume that C t = g(X t ,t) where 
g is a function defined on K + x T taking values in K + . If X is an ltd process, 
the problem is to characterize the process C. 

The same kind of problem arises when one wants to express prices (re- 
turns) starting from returns (prices) . If X is an ltd process representing the 
cumulated continuous return of a financial security, what is the dynamics of 
the corresponding price process Y t = exp(X t )? Obviously, the question can 
be asked the other way. If the dynamics of the price process Y is given, what 
is the dynamics of the continuous rate of return X t = ln(Y^)? 

An other example is the dynamics of bond prices as functions of the 
interest rate process. The problem is more complex because a bond price is 
determined by the term structure of interest rates, not by only one interest 
rate. To keep things very general, the bond price dynamics is a function of 
a family of processes driving interest rates. The models used in the financial 
literature are a simplified version of this complex reality. Some models are 
based on the assumption that bond prices only depend on the dynamics of 
the short rate, some others assume that they depend on the short rate and 
a long-term rate 3 . These simplifying assumptions are generally justified for 
practical implementation reasons. 

The mathematical tool allowing to answer the preceding questions is the 
so-called Ito's lemma which is a kind of Taylor's series expansion for ltd 
processes. 

3 See for example Vasicek (1977), Cox, Ingersoll and Ross (1985), Longstaff and 
Schwartz (1992), Heath, Jarrow and Morton (1992) and Brace, Gatarek and Musiela 
(1997). 
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3.3.1 Taylor's formula, an intuitive approach to Ito's 
lemma 

Let / denote a twice differentiable function denned from M 2 to M; the Taylor's 
series expansion of / at (xo,to) is written as: 

df df 
f[x,t) = f(x 0 ,t 0 ) + —(xo,to)(x-xo) + —(x 0 ,t 0 )(t-to) (3.12) 



ld 2 f 1 d 2 f 

+2 ( x o> *o) ( X ~ X °} 2 + 2W ^ ^ ~ 
d 2 f 

+ dxdt^ X °' * 0 ' ^ X ~ Xq ' ^ ~~ ^ +£ ( x °> *°) 



where e(xo,t 0 ) ~ o ((x — xq) 2 + (t — to) 2 ) . The condition on e shows that 
third-order terms are negligible w.r.t. first and second-order terms. In most 
cases, a first-order development is used in economic applications; it corre- 
sponds to the first line of the equation (3.12). 



The story is a little bit different when x is an ltd process which is written 
as a stochastic integral w.r.t. a Brownian motion. The variation of this 
process on a time-interval of length t — t 0 is of order y/t — to. Consequently, 
the second-order term ^(xo,to){x — xq) 2 cannot be neglected because it is 
0(t — to). It has the same magnitude as %(xq, to) (t — to). 



Let us now denote df(xo, to) = f(x, t)—f(xo, to) ; t— to = dt and x—xo 
dx; equation 3.12 becomes : 



df df 1 d 2 f 

df(x 0 ,t 0 ) = —(xo,to)dx + —(xo,to)dt+- — (x 0 ,to)(dx) 2 (3.13) 



Id f o f 

+ ^-r^{.Xo ) to)(dt) 2 + -—^(x 0 ,t 0 )dxdt + £(x 0 ,t 0 ) 

Replace x by X t with X a stochastic process characterized by: 



dX t = fi(X u t)dt + a{X t , t)dZ t 



Equation 3.13 (giving up the arguments of partial derivatives to simplify 
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notations) becomes, for (X t ,t) : 

df df 
df(X t ,t) = ^[ f x(X t ,t)dt + a(X t ,t)dZ t ] + -^dt 

HX t ,t)dt + a(X t ,t)dZ t ] 2 
+ o T^-(^) 2 + tH^ M x ut)dt + °{ x u t)dZ t ) dt + e(X t , t) 



2 dt 2 v ' dxdt 

Applying now the calculation rules defined in proposition 24 allows to see 
£1 . IIK ] 1EL 

dt 2 Alm dxdt 



that the coefficients of and ^-4r are negigible (they are o(dt)). It then 



follows: 

df df 
df(X t ,t) = ^\fi(Xt,t)dt + a(X t ,t)dZ t ] + -±dt 

+ \ £ [yu(x *' t)dt + cr(x *' t)dz ^ + £ '( Xt,t) 

with e' = o(dt). 

When developing the term \p(X t , t)dt + a(X t , t)dZ t ] 2 , one negligible term 
(0(dt 2 )) appears, one term dZ t dt is also negligible (because it is 0(dt 2 )) and, 
finally, one term dZf which is 0(dt) and then not negligible. After all possible 
simplifications, it remains: 

df(X t ,t) = ^(ii(X t ,t)dt + a(X t ,t)dZ t ) + ^dt 
+ y^a 2 {X t) t)dt + e"{X u t) 

with e" = o(dt). 

We can now write the process f(X t , t) as a stochastic differential by group- 
ing dt terms on one hand and dZ t terms on the other hand: 

df..,„ ^ , df , ld 2 f _ 2iv ^\ JAi _ /v ^df 



df{Xut) = \t KXut) + ft + 2d^ a {Xut) J dt+a(X t ,t)^dZ t +e"(X u t) 

This brief presentation gives the intuition of the result but to prove rig- 
orously Ito's lemma needs more precautions. Especially, the s(X t ,t) are 
stochastic and saying "e is negligible w.r.t. dt" is not sufficiently precise. 
Readers can find a complete demonstration of Ito's lemma in specialized 
books 4 . 



'For example Karatzas-Shreve (2000), chapter 3. 
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3.3.2 Ito's lemma 

We can now write more formally Ito's lemma which is nothing else than a 
Taylor formula in a specific stochastic environment. 

Proposition 25 Let X be an ltd process characterized by 

dX t = ii{X t , t)dt + a(X t , t)dZ t (3.14) 

and let f : M. 2 — > K a function with continuous partial derivatives up to order 
2. The process Y defined by Y t = f(X t ,t) is an ltd process with a stochastic 
differential given by: 

ay > = + I + \U° 2 ^) dt + ^fj z > 

If we write the stochastic differential of Y in the following form: 

dY t = n Y (Ft, t) dt + a Y (Y t , t) dZ t 

we get: 

df 

<j Y (Y u t) = <r(X t ,t)^ 

fi Y (Y t ,t) is the drift of the process f(X t ,t) and ay (Y t ,t) is the corre- 
sponding diffusion coefficient. 



3.3.3 Applications 

From return to price 

Let W denote a Wiener process with parameters \x and o (which are constant 
for W) and Y the process defined by Y t = f(W t ) = exp (W t ) . Y is then the 
transformation of the Brownian motion by the exponential function. Remark 
that t does not enter the transformation implying |£ = 0. The dynamics of 
Y is obtained by applying Ito's lemma. 

PrOte) = |. + f + ^ 2 = -P(^)(.+ y)=^(.+ y) 
a Y (Y t ,t) = o%=aY t 
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or, equivalently: 



dY t ( a 2 . , 
-y- = ( A* + y ) dt + adZ t 



Y (which may represent the price of a stock) is called a geometric 
Brownian motion. 

From price to return 



Symetrically, let Y denote a price process characterized by : 

Y 0 = 1 
dY t = fj,Y t dt + aY t dZ t 

Let X be defined as X t = g(Y t ) = ln(Y^). In this case, we get: 

a 2 

° Y *JT = * 
ox 

These equalities lead to: 



a 2 



dX t = — — j dt + adZ t 

Interest rates and the Ornstein-Uhlenbeck process 

An Ornstein-Uhlenbeck process is a process X with the following stochastic 
differential: 

dX t = ot{fi - X t )dt + adZ t 

This stochastic process is often used to describe the dynamics of short- 
term rates 6 . We observe that the drift is positive when the short rate X t is 



5 This process is called "geometric Brownian motion" . It is the usual process to modelize 
stock prices, especially in the option valuation model of Black and Scholes (1973). 
6 Vasicek (1977) was the first to propose this process for short term rates. 
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low (lower than (3) and the short rate tends to rise in the short run. On the 
contrary, when X t is high, the short rate tends to decrease. So this process 
is said "mean-reverting". The short rate evolves around its long term mean 7 
(3. The coefficient a measures the strength of mean reversion. 

In discrete-time, we encountered such a process when we described ran- 
dom draws in an urn (with white and black balls) without replacement. 

Interest rates and the square root process 

In the Ornstein Uhlenbeck process, the diffusion coefficient a doesn't depend 
on X t . As variations of Z during an interval of length dt are of order y/dt, 
this process can take negative values because, on the short run, the variation 
of Z is much larger than the variation coming from the drift. 

To solve this problem, Richard (1978) and then Cox-Ingersoll-Ross (1985) 
proposed the "square root process" defined as follows: 

dX t = a((3- X t )dt + o^X t dZ t 

We observe that the variance of the short rate variations is proportional 
to the level of the short rate. Especially, if the rate reaches 0, only the drift 
term a(3dt remains and is positive. Consequently, this process cannot take 
negative values. 

' Warning: f3 is the long-term mean of the short rate, not a long-term rate! 
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3.4 The Girsanov theorem 
3.4.1 Preliminaries 

Let W denote a Brownian motion with parameters (fi, a) , defined on a fil- 
tered probability space (Q, A, J 7 , P) where T is the natural filtration of W. 
It is also the natural filtration of Z, since W t = fit + aZ t . Assume that 
W t = ln(iS't) where S t is a stock price. W t — W s is then the logarithmic return 
of the stock on the time period [s; t] . Denote r the risk-free rate (assumed 
constant). The no arbitrage assumption implies there exists a risk- neutral 
probability measure Q under which the drift of W is equal to r. As we deal 

with two probability measures P and Q in this section, we denote Ep and 
Eq the expectation with respect to P and Q. 

Girsanov theorem is the technical tool that allows to transform a process 
with drift fi in a process with drift r ^ fi. 

In order to introduce this result, we first present the method which allows 
to change the mean of a Gaussian random variable. 



Proposition 26 Let X ~ A/"(0, 1) defined on (Q, A, P) and Q defined by: 



VA e A, Q(A) = E P 



l^exp ( aX 



a 



Q is equivalent to P and X ~ J\f (a, 1) under Q. 



Proof. As E P 



exp (aX — = exp (~f~J Ep [exp (aX)] and exp (aX) 
is lognormal with parameters (0, a) , the properties of the lognormal distrib- 
ution imply that E P [exp (aX)] = exp ( ^ 



We then get Q{Vt) = E P 



1. The fact that the proba- 



exp \ aX — ^j- 

bility measures P and Q are equivalent is obvious because of the exponential 
transformation which takes only strictly positive values. 
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We can write: 

Eq[X] 



XdQ 



X exp ( aX 



a 
~2 



dP 



J xexp Ira — —I jx\x)dx 



271 

1 



xexp ax 



oo 
+00 



a 
~2 



exp 



x 
"2 



xexp 



(x 



a) 



dx 



a 



dx 



where fx is the density of X under P. We observe that X is, under Q, a 
Gaussian variable with expectation a and variance 1. Therefore, the variable 

exp (aX — ^-j is the Radon-Nikodym derivative of Q w.r.t. P, denoted 



In a second step, a more sophisticated transformation may be introduced 
by considering X t ~ Af(nt,a 2 t) with fit = E P (X t ) and oH = V P (X t ). 
Suppose you want to transform X t in a Gaussian variable satisfying rt = 
Eq(X t ) and a 2 t = Vq (X t ) . In economic terms, it means that you start in 
an economy with risk averse agents and you go in another economy with risk 
neutral agents (the expected return on the risky asset is now the risk-free 
rate). 

Rewrite X as: 

X t = fit + aZ t = fit + ay/tY t 

where Z is a standard Brownian motion and Y t ~ Af (0, 1) . To get the desired 
result, you need to transform Y t in a gaussian variable following M (a, 1) with 
a coefficient a to be defined. However, you look for Q such that: 

Eq [X t ] = fit + ay/ia = rt 

It implies that a must be chosen as follows: 

(r - fi)Vi 



a: 



a 



dQ 
dP 



Proposition 26 says that the following transformation is necessary 

2 



exp 



{r-ti)Vt v 1 
a Yt ~2 



r 



exp 



a 1 2 V a 
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We can now present Girsanov theorem which generalizes the transformation 
presented above. 



3.4.2 Girsanov theorem 

Let A = (A*, t G [0; T]) denote a process adapted to T and L = (L t , t G [0; T]) 
defined by: 



L t = exp 



X s dZ s - - I X„ds 



Definition 31 A satisfies the Novikov condition if and only if: 



ex Pi - 



Kds 



< +oo 



Proposition 27 // A satisfies the Novikov condition, we get the following 
properties: 

1) The process L is a P— martingale 

2) The process Z* defined by: 



X.ds 



is a Brownian motion on (O, ^4., J 7 , Q) where Q is characterized by: 

dQ _ 
dF- LT 

It is worth noting that when A is constant, the martingale property for 
L has already been proved in the preceding chapter. In fact, we have in this 
case: 

L t = exp (^-XZ t - 
and we proved it is a martingale. 



3.4.3 Application 

The most common application of Girsanov theorem comes in valuing options 
by the risk-neutral approach. 
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In the Black-Scholes model, the dynamics of the stock price is: 

dS t = [iS t dt + aS t dZ t 

which leads to: 

a 2 



S t = S 0 exp I I n - — jt + crZ t 

The no arbitrage assuption implies that the discounted price process is a 
martingale under the risk-neutral probability 
But the discounted price is: 

exp (-rt) S t = S 0 exp ( \a - r - y ^ t + aZ t 

This process is transformed by Girsanov theorem in: 

( 2 
-y* + aZ t 

with z; = Z t + (^) t. 
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3.5 Stochastic differential equations 

3.5.1 Existence and unicity of solutions 

When building the stochastic integral, we proved that, under some condi- 
tions, a stochastic process is written as a stochastic integral. The symmetri- 
cal question is to know which assumptions on \x and a have to be satisfied for 
a stochastic differential to define a stochastic process (with nice properties!) 
written as a stochastic integral. 

In this section, it is assumed that the filtration T on (Q, A, P) is the 
natural filtration of a standard Brownian motion Z. 



Definition 32 A stochastic differential equation is given by a stochastic dif- 
ferential associated with a boundary condition, that is: 

X 0 = c (3.15) 
dX t = fi (X t , t)dt + a (X t , t) dZ t 



In the general case, c may be a random variable. However, in most 
financial models, c is a constant, for example the initial price of a financial 
asset or the initial short-term rate of interest. 



Definition 33 A stochastic process X is a solution of the equation appearing 
in definition 3.15 on [0;T] if: 

1) X is adapted to T 

2) The functions \i and a satisfy: 

T i>T 

|/i {X t ,t) | dt < +oo and / a 2 (X t , t) dt < +oo (3.16) 

Jo 

3) X satisfies: 

X t = X 0 + f fi(X s ,s)ds+ f a(X 8 ,s)dZ s 
Jo Jo 



The following proposition provides conditions on \i and a for a stochastic 
differential equation to have a solution. 
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Proposition 28 If conditions (a) and (b) hereafter are satisfied, equation 
3.15 has a unique solution (P-a.s), which is a stochastic process X adapted 

to JF, with continuous paths, satisfying E P y J Q T Xfdtj < +00. 

a) There exists m > 0 such that Vi G [0; T] , V (x, y) € M. 2 

max(\fi(x,t) - fi(y,t)\ ; \<r(x,t) - cr(y,t)\) <m\x-y\ 
fi(x, t) 2 + o~(x, t) 2 < m (l + x 2 ) for any pair (x, t) 

b) X 0 is square integrable, independent of Tt for any t. 

A detailed proof of this result may be found in Oksendal (2000), p66. 

The condition appearing in the first part of point (a) is called a Lipschitz 
condition. It limits the slopes of functions \i and a which must be finite and 
bounded by a constant which doesn't depend on t. This condition is standard 
when solving usual differential equations. The second part of condition (a) 
puts some restrictions on the growth of \x and a. As fi is the instantaneous 
expectation of X variations, the condition means that the LHS must be 
of order (1 + x 2 ) 2 . In other words, we cannot have an "exploding" drift. 
If the condition were not satisfied, the drift would grow too rapidly with 
the level reached by the process. For example, it would be the case with 
/i(x, t) = exp(x). 

It is worth noting that the solution provided here is called a "strong 
solution" because the Wiener process Z and the filtration are given. If we 
were solving the problem for the pair (X, Z) , starting from functions \x and 
a, we would speak about "weak solutions" (Karatzas-Shreve, 2000). 

In the following, we always denote X t a solution of equation 3.15 and 
assume that the conditions about existence and unicity of a solution are 
satisfied. 

Proposition 29 1) The solution of equation 3.15 is a Markov process whose 
initial distribution is the same as the one of c. 

2) If fj, and a are continuous functions of t, X is an ltd process with 
parameters fi and a. 
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3.5.2 A specific case: linear equations 

A stochastic differential equation (SDE in the following) is said linear if it is 
written as: 

X 0 = c 

dX t = aX t dt + a t dZ t 

where a is a constant. 

For this type of equation, the solution is written as: 

X t = cexp(at) + / exp [a (t — s)] a s dZ s (3-17) 
Jo 

In fact, equation 3.17 may be transformed in: 

X t exp(— at) = c + / exp (—as) a s dZ s 
Jo 

The process in the RHS, denoted Y, may be written in the following way: 

Y 0 = c 
dY t = exp(—at)a t dZ t 

If we write exp (at) = / (Y t , t) , the partial derivatives of / are given by: 

— = aexp(at)Y t 

— = exp (at) 

*l = 0 
dY? 

Ito's lemma allows to write: 

df (Y t , t) = a exp(at)Y t dt + exp(at) exp(— at)a t dZ t 
= aexp(at)Y t dt + a t dZ t 

Replacing Y t by exp(— at)X t leads to: 

dX t = aX t dt + a t dZ t 
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which is the initial stochastic differential. 

More generally (see Malliaris-Brock (1982)), the following SDE: 



X 0 = c 

dX t = a(t)X t dt + a t dZ t 



has the following solution X: 



Xt = l t 



c+ -f s 1 a s dZ s 
Jo 



where 7 is a solution of the differential equation 

f(t)=a(t)f(t) 

Application 

Let Y an Ornstein-Uhlenbeck process characterized by: 

Y 0 = yo 
dY t = a (P - Y t ) dt + adZ t 

Define X t = (Y t — (3)exp(at) = f(Y t ,t); the partial derivatives of / are 
given by: 



d_l 
dt 
df_ 
dY t 
d 2 f 
dY? 



aexp(at) (Y t — (5) 
exp(at) 

0 



Ito's lemma leads to: 



dX t = [a (Y t — [}) exp(at) + exp(at)a ((3 — Y t )] dt + a exp(at)dZ t 
= aexp(at)dZ t 



As X 0 = yo — (3, it follows: 

X t = yo - (3 + a / exp(as)dZ s 
Jo 
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The relationship between X and Y may be written as: 

Y t = + X t exp(-at) 

Therefore: 

Y t = (3 + exp(— at) y 0 — j3 + a / exp(as)dZ s 

L Jo 

= (5 (1 — exp(— at)) + y 0 exp(— at) + a / exp(— a(t — s))dZ s 

Jo 

Y t follows a Gaussian distribution satisfying: 

E P [Y t \Y 0 =y 0 ] = (3 (1 - exp(-crf)) + y 0 exp(-at) 

Vp[Y t \Y 0 = y 0 ] = a 2 [ ex V (-2a(t - s))ds 

Jo 



a 
2a 



[1 - exp {-2at)\ 



CD 
> 
T3 
CO 

CD 



O 

"5 

CD 
CO 
CC 
CD 



Turning a challenge into a learning curve. 

Just another day at the office for a high performer. 



Accenture Boot Camp - your toughest test yet 

Choose Accenture for a career where the variety of opportunities and challenges allows you to make a 
difference every day. A place where you can develop your potential and grow professionally, working 
alongside talented colleagues. The only place where you can learn from our unrivalled experience, while 
helping our global clients achieve high performance. If this is your idea of a typical working day, then 
Accenture is the place to be. 



It all starts at Boot Camp. It's 48 hours packed with intellectual challenges 



that will stimulate your mind and 
enhance your career prospects. You'l 
spend time with other students, top 
Accenture Consultants and special 
guests. An inspirational two days 



and activities designed to let you 
discover what it really means to be a 
high performer in business. We can't 
tell you everything about Boot Camp, 
but expect a fast-paced, exhilarating 



and intense learning experience. 
It could be your toughest test yet, 
which is exactly what will make it 
your biggest opportunity. 

Find out more and apply online. 



Visit accenture.com/bootcamp 

Consulting • Technology • Outsourcing 



accenture 

High performance. Deliverec 



99 



Download free ebooks at bookboon.com 



Stochastic Processes for Finance 



Bibliography 



Bibliography 



[1] Biingham. N.H. and Kiesel. R. (2001), Risk-Neutral Valuation: Pricing 
and Hedging of Financial Derivatives, Springer Finance. 

[2] Black. F and Scholes.M. (1973), The Pricing of Options and Corporate 
Liabilities, Journal of Political Economy, 81, 637-54. D. and M. Musiela 
(1997) 

[3] Brace. A., Gatarek D. and Musiela M. (1997), The Market Model of 
Interest Rate Dynamics, Mathematical Finance, 7, 127-155. 

[4] Cox. J. and Ross. S. (1976), The Valuation of Option for Alternative 
Stochastic Processes, Journal of Financial Economics 3, 145-166. 

[5] Cox. J., Ross. S. and Rubinstein.M (1979), Option Pricing : a Simplified 
Approach, Journal of Financial Economics 7, 229-263. 

[6] Cox. J., Ingersoll. J. and Ross. S. (1985), A Theory of the Term Structure 
of Interest Rates, Econometrica, 53, 385-407. 

[7] De Bondt. W. and Thaler. R. (1985), Does the Stock market Overreact?, 
Journal of finance, 40, 3, 793-805. 

[8] Dothan. M. (1990), Prices in Financial Markets, Oxford university Press 

[9] Duffie.D (1988), Security Markets, Stochastic Models, Academic Press. 

[10] Fama, E. (1970), Efficient markets : a review of theory and empirical 
work, Journal of finance, 25, 2, 383-417 

[11] Grossman, S.J. (1976), On the Efficiency of Competitive Stock Markets 
when Traders Have Diverse Information, Journal of Economic Theory, 
31, 573-585. 



100 



Download free ebooks at bookboon.com 



Stochastic Processes for Finance 



Bibliography 



[12] Harrison. J. M and Kreps.D.M (1979), Martingales and Arbitrage in 
Multi-period Securities Markets, Journal of Economic Theory, 20, p381- 
408. 

[13] Harrison. J. M and Pliska.S (1981), Martingales and Stochastic Integrals 
in the Theory of Continuous Trading, Stochastic Processes and their 
Applications, 11, 215-60. 

[14] Heath. D., Jarrow R. and Morton A. (1992), Bond pricing and the Term 
Structure of interest Rates: A New methodology, Econometrica, 60, 77- 
105. 

[15] Huang.C.F and Litzenberger.R (1988), Foundations for Financial Eco- 
nomics, North-Holland. 

[16] Hull, J.C. (2009), Options, Futures, and other Derivatives, Pearson, 7th 
ed., Prentice Hall. 

[17] Karatzas. I. and Shreve. S.E.(2000), Brownian Motion and Stochastic 
Calculus, 2nd ed., Springer. 

[18] Koch-Medina.P. and Merino. S. (2003), Mathematical Finance and Prob- 
ability, Birkhauser Verlag. 

[19] Longstaff. F.A. and Schwartz. E.S. (1992), Interest Rate Volatility and 
the Term Structure: A Two Factor General Equilibrium Model, Journal 
of finance, 47, 1259-1282. 

[20] Malkiel, B. (2003), A Random Walk Down Wall Street, W.W. Norton 
& Co. (original edition in 1973). 

[21] Malliaris, A.G. and Brock, W.A. (1988), Stochastic Methods in Eco- 
nomics and Finance, North Holland. 

[22] Markowitz.H (1952), Portfolio Selection, Journal of Finance, 7, 77-91. 

[23] Markowitz.H (1959), Portfolio Selection : Efficient Diversification of 
Investment, Wiley New York. 

[24] Merton, R., C. (1969), Lifetime Portfolio Selection under Uncertainty: 
The Continuous Time Case, Review of Economics and Statistics, 50, 
247-257. 



101 



Download free ebooks at bookboon.com 



Stochastic Processes for Finance 



Bibliography 



[25] Merton, R., C. (1971), Optimal Consumption and Portfolio Rules in a 
Continuous-Time Model, Journal of Economic Theory, 3, 373-413. 

[26] Merton, R., C. (1973), Theory of rational option pricing, Bell Journal 
of Economics and Management Science, 4, 141-83. 

[27] Merton, R. C. (1976), Option Pricing when Underlying Returns are 
Discontinuous, Journal of Financial Economics, 3, 125-144. 

[28] Merton, R. C. (1982), On the Mathematics and Economics Assumptions 
of Continuous-Time Models, in W.F. Sharpe and CM. Cootner, eds, 
Financial Economics: Essays in the Honor of paul Cootner, Englewood 
Cliffs, NJ: Prentice Hall. 

[29] Merton, R. C. (1990), Continuous- Time Finance, Basil Blackwell. 

[30] Mossin.J (1966), Equilibrium in a Capital Asset Market, Econometrica, 
34, 261-76. 

[31] Oksendal. B. (2000), Stochastic Differential Equations: An Introduction 
with Applications, 5th ed., Springer 

[32] Richard. S.F. (1978), An arbitrage model of the term structure of interest 
rates, Journal of Financial Economics, 6, 33-57. 

[33] Roger. P. (2010), Probability for Finance, Ventus Publishing, 
www.bookboon.com. 

[34] Vasicek O. (1977), An Equilibrium Characterization of the Term Struc- 
ture, Journal of Financial Economics, 5, 177-188. 



102 



Download free ebooks at bookboon.com 



Stochastic Processes for Finance 



Index 



Index 

Brownian motion, 57 
general, 63 
geometric, 85 
Markov process, 59 
martingale, 59 
path 

simulation, 65 
stopping time, 67 
transformation, 60 

Doob 

decomposition, 25, 51 
martingale, 24 

Filtration, 9, 44 

complete, 44 

filtered probability space, 44 
natural, 10, 44 
right-continuous, 44 

Girsanov theorem, 89 
application, 89 
Novikov condition, 89 

Ito 

lemma, 84 

application, 84 

Taylor series expansion, 82 
process, 48 

diffusion coefficient, 49 

drift, 49 

Landau notations, 55 

Markov 

chain, 15 

accessible, 17 
aperiodic, 18 

Chapman- Kolmogorov equations, 16 
communicating class, 17 
communication, 17 
homogeneous, 15 
irreducible, 18 
periodicity, 18 



positively recurrent, 19 
recurrent, 19 

stationary distribution, 20 

transient, 19 
no memory process, 48 
process, 15, 48 

Brownian motion, 59 
transition matrix, 15 
Martingale, 21, 49 

Brownian motion, 59 
Doob, 24 

submartingale, 21, 49 
super-martingale, 21, 49 
Modi cation 

stochastic process, 43 

Novikov 

condition, 89 

Path, 40 

cadlag, 41 
caglad, 43 
continuous, 40 

Brownian motion, 58 
LCPvL, 43 

nowhere differentiable, 59 

RCLL, 41 
Poisson 

distribution, 47 

process, 47 
Probability 

transition, 15 

Snell envelope, 36 
Stochastic differential equation, 91 
linear, 93 
solution 

conditions, 92 
de nition, 91 
Markov, 92 
Stochastic integral, 71 

calculation rules, 79 
definition, 76 
properties, 78 
stochastic differential, 78 



103 



Download free ebooks at bookboon.com 



Stochastic Processes for Finance 



Index 



Stochastic process 

adapted, 10, 44 
Brownian motion, 57 
continuous-time, 40 
diffusion, 48 
diffusion coefficient, 49 
discrete-time, 6 
drift, 49 
increments 

independent, 45 

stationary, 45 
indistinguishable, 43 
Ito, 48 

Markov, 15, 48 

modification, 43 

Ornstein-Uhlenbeck, 85 

path, 40 

Poisson, 47 

predictable, 12, 45 

random walk, 22, 46, 5 1 

square root, 86 

stopped process, 33, 37 

trajectory, 40 

Wiener, 57 
Stopping time, 32, 37, 65 

American option, 35 

optional stopping theorem, 32 
Strategy 

doubling, 34 

portfolio, 28 

self- nancing, 28 

Transition 

matrix, 15 
probability, 15 

Wiener process, 57 
general, 63 



104 



Download free ebooks at bookboon.com 



